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METHODS AND COMPOSITIONS FOR MODIFYING APOLIPOPROTEIN B 

mRNA EDITING 

This application claims the benefit of U.S. Provisional Patent 
5 Application Serial No. 60/271,856, filed February 27, 2001, which is hereby 
incorporated by reference in its entirety. 

This invention was made, at least in part, using funding received from 
the U.S. Public Health Service, grant DK43739. The U.S. government may have 
certain rights in this invention. 

10 

FIELD OF THE INVENTION 

The present invention related generally to the chimeric proteins, 
compositions and products containing one or more chimeric proteins, as well as the 
1 5 use thereof to modify apolipoprotein B processing, to treat or prevent atherogenic 
diseases or disorders, and to modify the intravascular lipoprotein population. 

BACKGROUND OF THE INVENTION 

20 Cholesterol is carried in blood by specific carrier proteins called 

apolipoproteins and from one tissue to another as lipoprotein particles, Apolipoprotein 
B is an integral and non-exchangeable structural component of lipoprotein particles 
referred to as chylomicrons, very low density lipoprotein ("VLDL"), and low density 
lipoprotein ("LDL"). Apolipoprotein B circulates in human plasma as two isoforms, 

25 apolipoprotein B100 and apolipoprotein B48. Apolipoprotein B48 is generated by an 
RNA editing mechanism which changes codon 2153 (CAA) to a translation stop codon 
(UAA) (Chen et al., "Apolipoprotein B-48 is the product of a messenger RNA with an 
organ-specific in-frame stop codon," Science 238:363-366 (1987); Powell et al., "A 
novel form of tissue-specific RNA processing produces apolipoprotein-B48 in 

30 intestine," Cell 50:831-840 (1987)). Editing is a site-specific deamination event 

catalyzed by apolipoprotein B mRNA editing catalytic subunit 1 (known as APOBEC- 
1) (Teng et al., "Molecular cloning of an apo B messenger RNA editing protein," 
Science 260:181 16-1819 (1993)) with the help of auxiliary factors (Teng et al., 
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"Molecular cloning of an apo B messenger RNA editing protein," Science 260: 18116- 
1819 (1 993); Yang et al., "Partial characterization of the auxiliary factors involved in 
apo B mRNA editing through APOBEC-1 affinity chromatography," J. Biol. Chem. 
272:27700-27706 (1997), Yang et al, "Multiple protein domains determine the cell 
5 type-specific nuclear distribution of the catalytic subunit required for apo B mRNA 
editing," Proc. Natl. Acad. Sci. USA 94:13075-13080 (1997); Lellek et al., 
"Purification and Molecular cloning of a novel essential component of the apo B 
mRNA editing enzyme complex," J. Biol. Chem. 275: 19848-19856 (2000), Mehta et 
al., "Molecular cloning of apobec-1 complementation factor, a novel RNA-binding 

1 0 protein involved in the editing of apolipoprotein B mRNA," Mol. Cell. Biol. 20: 1 846- 
1854 (2000); Yang et al., "Induction of cytidine to uridine editing on cytoplasmic 
apolipoprotein B mRNA by overexpressing APOBEC-1," J. Biol. Chem. 275:22663- 
22669 (2000); Blanc et al., "Identification of GRY-RBP as an apoB mRNA binding 
protein that interacts with both apobec-1 and with apobec-1 complementation factor 

15 (ACF) to modulate C to U editing," J. Biol. Chem. 276: 10272-10283 (2001)) as a 
holoenzyme or editosome (Smith et al. "In vitro apolipoprotein B mRNA editing: 
Identification of a 27S editing complex," Proc. Natl. Acad. Sci. USA 88: 1489-1493 
(1991), Harris et al., "Extract-specific heterogeneity in high-order complexes 
containing apo B mRNA editing activity and RNA-binding proteins," J. Biol. Chem. 

20 268:7382-7392 (1993)). Apolipoprotein B100 and apolipoprotein B48 play different 
roles in lipid metabolism, most importantly, apolipoprotein BlOO-associated 
lipoproteins (VLDL and LDL) are much more atherogenic than apolipoprotein B48- 
associated lipoproteins (chylomicrons and their remnants and VLDL). 

Specifically, the apolipoprotein B48-associated lipoproteins are cleared 

25 from serum more rapidly than the apolipoprotein BlOO-associated lipoproteins. As a 
result, apolipoprotein B48-VLDL usually are not present in serum for an amount of 
time sufficient for serum lipases to convert the VLDL to LDL. In contrast, the 
apolipoprotein B 100- VLDL are present in the serum for sufficient amounts of time, 
allowing serum lipases to convert the VLDL to LDL. Elevated serum levels of LDL 

30 are of particular biomedical significance as they are associated with an increased risk of 
atherogenic diseases or disorders. Lipoprotein analyses have shown that the ability of 
mammalian liver to edit results in a lowering of the VLDL + LDL : HDL ratio. 
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Therefore, it would be desirable to identify an approach for modifying apolipoprotein 
B editing which would favor an increase in the relative concentration of apolipoprotein 
B48 in proportion to apolipoprotein B100 (or total apolipoprotein concentration), 
thereby deal ing a greater concentration of lipoproteins from serum and minimizing the 
5 atherogenic risks associated with high serum levels of VLDL and LDL. 

Current lipid-lowering therapies include statins and bile-acid-binding 
resins. Statins are competitive inhibitors of hydroxymethylglutaryl-coenzyme A 
(HMG-CoA) reductase, which catalyzes the committed step in the synthesis of 
cholesterol (Davignon et al, "HMG-CoA reductase inhibitors: a look back and a look 

10 ahead," Can. J. Cardiol. 8:843-64 (1992)). Bile-acid-binding resins sequester bile acids 
in the intestine, thereby interrupting the enterohepatic circulation of bile acids and 
increasing the elimination of cholesterol from the body. These are effective therapies 
for some patients with hyperlipidemia; however, adverse effects have been observed in 
up to 30% of the patients, suggesting the need for alternative therapies. Mutations in 

1 5 the gene encoding the LDL-receptor or apolipoprotein B can cause a human genetic 
disease known as familial hypercholesterolemia, characterized by an elevated level of 
cholesterol and early atherosclerosis due to the defect in LDL-receptor mediated 
cholesterol uptake by cells (Goldstein et al., Familial hypercholesterolemia," In The 
Metabolic and Molecular Bases of Inherited Disease , Vol. 2., pi 98 1-2030, Scriver et 

20 al. (eds.), McGraw-Hill, New York (1995)). Therapy for children with this disorder is 
needed in order to prevent morbidity or mortality, however the National Cholesterol 
Education Program (NCEP) recommends consideration of drug treatment only for 
children 10 years of age or older due to the risk that prolonged drug therapy may 
impair growth and pubertal development. Developing alternative approaches for 

25 lowering serum LDL levels is therefore essential for the sectors of the population still 
at risk. 

Stimulating hepatic apolipoprotein B mRNA editing is a means of 
reducing serum LDL through the reduction in synthesis and secretion of apolipoprotein 
B 100 containing VLDL. In most mammals (including humans), apolipoprotein B 
30 mRNA editing is carried out only in the small intestine. The presence of substantial 
editing in liver (found in 4 species) is associated with a less atherogenic lipoprotein 
profile compared with animals that do not have liver editing activity (Greeve et al., 
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"Apolipoprotein B mRNA editing in 12 different mammalian species: hepatic 
expression is reflected in low concentrations of apoB-containing plasma lipoproteins," 
J. Lipid Res. 34: 1367-1383 (1993)). APOBEC-1 is expressed in all tissues that cany 
out apolipoprotein B mRNA editing (Teng et al., "Molecular cloning of an apo B 
5 messenger RNA editing protein," Science 260: 18116-1819 (1 993)). Human liver does 
not express APOBEC-1 but it does express sufficient auxiliary proteins to complement 
exogenous APOBEC- 1 in apolipoprotein B mRNA editing in transfected cells (Teng et 
al., "Molecular cloning of an apo B messenger RNA editing protein," Science 
260:181 16-1819 (1993); Sowden et al., "Apolipoprotein B RNA Sequence 3' of the 

10 mooring sequence and cellular sources of auxiliary factors determine the location and 
extent of promiscuous editing," Nucleic Acids Res. 26:1644-1652 (1998)). 

Transgenic experiments aiming to enhance hepatic editing through 
apobec-1 gene transfer have shown a marked lowering of plasma apolipoprotein B100 
and significant reduction of serum LDL (Teng et al., "Adenovirus-mediated gene 

1 5 transfer of rat apolipoprotein B mRNA editing protein in mice virtually eliminates 
apolipoprotein B100 and normal low density lipoprotein production," J. Biol. Chem. 
269:29395-29404 (1994); Hughs et al., "Gene transfer of cytidine deaminase 
APOBEC-1 lowers lipoprotein(a) in transgenic mice and induces apolipoprotein B 
mRNA editing in rabbits," Hum. Gene Ther. 7:39-49 (1996); Nakamuta et al, 

20 "Complete phenotypic characterization of the apobec-1 knockout mice with a wild- 
type genetic background and a human apolipoprotein B transgenic background, and 
restoration of apolipoprotein B mRNA editing by somatic gene transfer of Apobec-1," 
J. Biol. Chem. 271:25981-25988 (1996); Kozarsky et al., "Hepatic expression of the 
catalytic subunit of the apolipoprotein B mRNA editing enzyme ameliorates 

25 hypercholesterolemia in LDL receptor-deficient rabbits," Hum. Gene Ther. 7:943-957 
(1996), Farese et al, "Phenotypic analysis of mice expressing exclusively 
apolipoprotein B48 or apolipoprotein B100," Proc. Natl Acad. Sci. USA 93:6393- 
6398 (1996); Qian et al., "Low expression of the apolipoprotein B mRNA editing 
transgene in mice reduces LDL but does not cause liver dysplasia or tumors," 

30 Arteriosc. Thromb. Vase. Biol. 18:1013-1020 (1998); Wu et al, "Normal perinatal rise 
in serum cholesterol is inhibited by hepatic delivery of adenoviral vector expressing 
apolipoprotein B mRNA editing enzyme in rabbits," J. Surg. Res. 85: 148-1 57 (1 999)). 
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Apolipoprotein B 100 is not essential for life as mice that synthesize exclusively 
apolipoprotein B48 (apolipoprotein B48-only mice) generated through targeted 
mutagenesis developed normally, were healthy and fertile. Compared with wild-type 
mice fed on a chow diet, the level of LDL-cholesterol was lower in apolipoprotein 
5 B48-only mice (Farese et al., "Phenotypic analysis of mice expressing exclusively 
apolipoprotein B48 or apolipoprotein B100," Proc. Natl. Acad. Sci. USA 93:6393- 
6398 (1996)). However, the induction of apolipoprotein B mRNA editing activity 
through apobec-1 gene transfer and tissue-specific overexpression poses a significant 
challenge in that it has induced hepatocellular dysplasia and carcinoma in transgenic 

10 mice and rabbits (Yamanaka et al, "Apolipoprotein B mRNA editing protein induces 
hepatocellular carcinoma and dysplasia in transgenic animals.," Proc. Natl. Acad. ScL 
USA 92: 8483-8487 (1995); Yamanaka et al., "Hyperediting of multiple cytidines of 
apolipoprotein B mRNA by APOBEC-1 requires auxiliary protein(s) but not a mooring 
sequence motif," J. Biol. Chem. 271:1 1506-1 1510 (1996); Yamanaka et al., "A novel 

15 translational repressor mRNA is edited extensively in livers containing tumors caused 
by the transgene expression of the apoB mRNA editing enzyme," Genes & Dev. 
1 1 :32 1-333 (1997)). This was proposed to be due to persistent high levels of 
APOBEC-1 expression resulting in unregulated and nonspecific mRNA editing 
(Sowden et al., "Overexpression of APOBEC-1 results in mooring-sequence- 

20 dependent promiscuous RNA editing," J. Biol. Chem. 271:301 1-3017 (1996); 

Yamanaka et al., "A novel translational repressor mRNA is edited extensively in livers 
containing tumors caused by the transgene expression of the apoB mRNA editing 
enzyme," Genes & Dev. 1 1:321-333 (1997); Sowden et al., "Apolipoprotein B RNA 
Sequence 3' of the mooring sequence and cellular sources of auxiliary factors 

25 determine the location and extent of promiscuous editing," Nucleic Acids Res. 

26: 1644-1652 (1998)). Adverse effects were not observed in transgenic animals with 
low to moderate levels of APOBEC-1 expression (Teng et al., "Adenovirus-mediated 
gene transfer of rat apolipoprotein B mRNA editing protein in mice virtually eliminates 
apolipoprotein B100 and normal low density lipoprotein production," J. Biol. Chem. 

30 269:29395-29404 (1994); Qian et al., "Low expression of the apolipoprotein B mRNA 
editing transgene in mice reduces LDL but does not cause liver dysplasia or tumors," 
Arteriosc. Thromb. Vase. Biol. 18:1013-1020 (1998); Wu et al., "Normal perinatal rise 
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in serum cholesterol is inhibited by hepatic delivery of adenoviral vector expressing 
apolipoprotein B mRNA editing enzyme in rabbits," J. Surg. Res. 85: 148-157 (1999)). 
Despite the limited success of apobec-1 gene therapy in modifying apolipoprotein B 
mRNA editing, such gene therapy poses too great a risk of adverse effects stemming 
5 from either persistent elevated levels of APOBEC-1 expression or problems associated 
with the use of infective transformation vectors (e.g., adenoviral vectors). 

For these reasons, it would be desirable to identify an approach to 
achieve apolipoprotein B mRNA editing, where its induction can be maintained at low 
levels and importantly, achieved in a transient manner. Moreover, it would be 
1 0 desirable to identify an approach to achieve apolipoprotein B mRNA editing which is 
substantially free of the side-effects observed with reported gene therapy approaches. 
The present invention is directed to overcoming these and other deficiencies in the art. 



SUMMARY OF THE INVENTION 

15 

A first aspect of the present invention relates to a chimeric protein 
including: a first polypeptide that includes a protein transduction domain and a second 
polypeptide that includes APOBEC-1 or a fragment thereof which can edit mRNA 
encoding apolipoprotein B. 
20 A second aspect of the present invention relates to a chimeric protein 

including: a first polypeptide that includes a protein transduction domain; and a second 
polypeptide that includes APOBEC-1 Complementation Factor ("ACF") or a fragment 
thereof which can bind to apolipoprotein B mRNA to facilitate editing of the mRNA 
by APOBEC-1. 

25 Third and fourth aspects of the present invention relate to DNA 

molecules which encode one of the chimeric proteins of the present invention. DNA 
constructs, expression vectors, and recombinant host cells including such DNA 
molecules are also disclosed. 

A fifth aspect of the present invention relates to a composition which 

30 includes: a pharmaceutical^ acceptable carrier and a chimeric protein of the present 
invention. 
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A sixth aspect of the present invention relates to a composition which 
includes: a first chimeric protein including a first polypeptide that includes a protein 
transduction domain and a second polypeptide that includes AJPOBEC-1 or a fragment 
thereof which can edit mRNA encoding apolipoprotein B, and a second chimeric 
5 protein including a first polypeptide that includes a protein transduction domain and a 
second polypeptide that includes ACF or a fragment thereof which can bind to 
apolipoprotein B mRNA to facilitate editing of the mRNA by APOBEC-1 or the 
fragment thereof. 

A seventh aspect of the present invention relates to a delivery device 

10 which includes either a chimeric protein of the present invention or a composition of 
the present invention. 

An eighth aspect of the present invention relates to a method of 
modifying apolipoprotein B mRNA editing in vivo which includes: contacting 
apolipoprotein B mRNA in a cell with a chimeric protein including a first polypeptide 

1 5 that includes a protein transduction domain and a second polypeptide that includes 
APOBEC-1 or a fragment thereof which can edit mRNA encoding apolipoprotein B, 
under conditions effective to increase the concentration of apolipoprotein B48 which is 
secreted by the cell as compared to the concentration of apolipoprotein B100 which is 
secreted by the cell, relative to an untreated cell. 

20 A ninth aspect of the present invention relates to a method of reducing 

serum LDL levels which includes: delivering into one or more cells of a patient, 
without genetically modifying the cells, an amount of a protein comprising APOBEC-1 
or a fragment thereof which can edit mRNA encoding apolipoprotein B, which amount 
is effective to increase the concentration of VLDL-apolipoprotein B48 that is secreted 

25 by the one or more cells into serum and, consequently, reduce the serum concentration 
of LDL. 

A tenth aspect of the present invention relates to a method of treating 
or preventing an atherogenic disease or disorder which includes: administering to a 
patient an effective amount of a protein including APOBEC-1 or a fragment thereof 
30 which can edit mRNA encoding apolipoprotein B, wherein upon said administering the 
protein is taken up by one or more cells of the patient that can synthesize and secrete 
VLDL-apolipoprotein B under conditions which are effective to increase the 
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concentration of VLDL-apolipoprotein B48 that is secreted by the one or more cells 
into serum, whereby rapid clearing of VLDL-apolipoprotein B48 from serum 
decreases the serum concentration of LDL to treat or prevent the atherogenic disease 
or disorder. 

5 An eleventh aspect of the present invention relates to a liposome or 

niosome which is targeted for uptake by a liver cell, the liposome or niosome 
containing (i) APOBEC-1 or a fragment thereof which is effective to edit 
apolipoprotein B mRNA, (ii) ACF or a fragment thereof which is effective to bind 
apolipoprotein B mRNA, or (iii) a combination thereof Compositions which include 

10 the liposome or niosome are also disclosed. 

The present invention demonstrates the efficacy of protein-mediated 
delivery to increase intracellular APOBEC-1 in cells which produce and secrete 
VLDL-apolipoprotein B. By increasing the extent of apolipoprotein B mRNA editing 
in vivo, it is possible to modify the ratio of VLDL-apolipoprotein B48 to VLDL- 

15 apolipoprotein B100 which is secreted by such cells, specifically increasing the relative 
serum concentration of VLDL-apolipoprotein B48 and decreasing the relative serum 
concentration of VLDL-apolipoprotein Bl 00. Due to the nature of these complexes, 
the B48 complex is cleared much more rapidly from serum, minimizing the conversion 
of VLDL into LDL, a major atherogenic disease factor. By minimizing the amount of 

20 VLDL-apolipoprotein B100 and increasing the amount of VLDL-apolipoprotein B48, 
it is possible to both treat and prevent atherogenic diseases or disorders. Moreover, by 
using protein delivery, it is possible to avoid the apparently unavoidable side effects of 
gene therapy. These results presented here open new possibilities for the treatment of 
hyperlipidemia through the induction of precisely controlled hepatic editing activity. 

25 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1A-D illustrate the structure (1A) and both nucleotide (1B-C, 
SEQ ID No: 1) and amino acid (ID, SEQ ID No: 2) sequences for an exemplary first 
30 chimeric protein (designated TAT-hAPOBEC-CMPK) specific for human 

apolipoprotein B mRNA editing. In Figures 1B-C, the region encoding human 
APOBEC-1 is shown in lowercase letters and the start codon for this construct is at 
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the beginning of the sequence. The sequences encoding a TAT protein transduction 
domain and a hemagglutinin domain are shown in uppercase letters near the 5' end 
(i.e., upstream of the APOBEC-1 sequence). The sequence encoding CMPK is shown 
3' of the APOBEC-1 sequence in uppercase letters. At the 3' terminal region and 
5 shown in lowercase letters is a sequence encoding a histidine tag. In Figure ID, 

beginning from the N-terminal end, the TAT protein transduction domain is shown in 
bold, followed by the hemagglutinin domain also shown in bold, human APOBEC-1 
shown underlined, CMPK also shown underlined, and the histidine tag shown in bold 
at the C-terminus. 

10 Figures 2A-D illustrate the structure (2 A) and both nucleotide (2B-C, 

SEQ ID No: 3) and amino acid (2D, SEQ ID No: 4) sequences for an exemplary first 
chimeric protein (designated TAT-rAPOBEC-CMPK) specific for rat apolipoprotein B 
mRNA editing. In Figures 2B-C, the region encoding rat APOBEC-1 is shown in 
lowercase letters and the start codon for this construct is at the beginning of the 

1 5 sequence. The sequences encoding a TAT protein transduction domain and a 

hemagglutinin domain are shown in uppercase letters near the 5' end (i.e., upstream of 
the APOBEC-1 sequence). The sequence encoding CMPK is shown 3' of the 
APOBEC-1 sequence in uppercase letters. At the 3' terminal region and shown in 
lowercase letters is a sequence encoding a histidine tag. In Figure 2D, beginning from 

20 the N-terminal end, the TAT protein transduction domain is shown in bold, followed 
by the hemagglutinin domain also shown in bold, rat APOBEC-1 shown underlined, 
CMPK also shown underlined, and the histidine tag shown in bold at the C-terminus. 

Figures 3A-C illustrate the structure (3 A) and both nucleotide (3B, 
SEQ ID No: 5) and amino acid (3C, SEQ ID No: 6) sequences for an exemplary 

25 second chimeric protein (designated TAT-hACF) specific for complementing human 
APOBEC-1 . In Figure 3B, the region encoding human ACF is shown in lowercase 
letters and the start codon for this construct is at the beginning of the sequence. The 
sequence encoding a TAT protein transduction domain and a hemagglutinin domain is 
shown in uppercase letters near the 5' end (i.e., upstream of the ACF sequence). At the 

30 3' terminal region and shown in lowercase letters is a sequence encoding a histidine 
tag. In Figure 3C, beginning from the N-terminal end, the TAT protein transduction 
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domain is shown in bold, followed by the hemagglutinin domain also shown in bold, 
human ACF shown underlined, and the histidine tag shown in bold at the C-terminus. 

Figures 4A-C illustrate the structure (4A) and both nucleotide (4B, 
SEQ ID No: 7) and amino acid (4C, SEQ ID No: 8) sequences for an exemplary 
5 second chimeric protein (designated TAT-rACF) specific for complementing rat 

APOBEC-1. In Figure 4B, the region encoding rat ACF is shown in lowercase letters 
and the start codon for this construct is at the beginning of the sequence. The 
sequence encoding a TAT protein transduction domain and a hemagglutinin domain is 
shown in uppercase letters near the 5* end (i.e., upstream of the ACF sequence). At the 

10 3' terminal region and shown in lowercase letters is a sequence encoding a histidine 
tag. In Figure 4C, beginning from the N-terminal end, the TAT protein transduction 
domain is shown in bold, followed by the hemagglutinin domain also shown in bold, rat 
ACF shown underlined, and the histidine tag shown in bold at the C-terminus. 

Figures 5A-B illustrate the purification of full-length TAT-rAPOBEC- 

1 5 CMPK protein. In Figure 5A, a schematic image illustrates generally the structure of a 
prokaryotic expression vector, pET-24b, encoding the TAT fusion protein. Figure 5B 
illustrates the image of a gel following two-column purification and silver-staining. 
The TAT fusion protein is the only protein recovered in significant concentrations. 

Figures 6A-F are images of immuno-stained cells exposed to the TAT 

20 fusion protein TAT-rAPOBEC-CMPK. McArdle cells were treated with 650 nM of 
recombinant TAT-rAPOBEC-CMPK for the indicated times (lh, 6h, or 24h). Cells 
were fixed, permeabilized, reacted with antibody to the HA epitope and FITC- 
conjugated anti-mouse secondary antibody and mounted in DAPI containing buffer as 
described in the Examples. Arrowheads indicated the position of select nuclei. 

25 Figures 7A-F are images of immuno-stained cell exposed to TAT- 

CMPK fusion protein. McArdle cells were treated with 1 125 nM of recombinant 
TAT-CMPK for the indicated times (lh, 6h, or 24h). Cells were fixed, permeabilized, 
reacted with antibody to the HA epitope and FITC-conjugated anti-mouse secondary 
antibody and mounted in DAPI containing buffer as described in the Examples. 

30 Arrowheads indicated the position of select nuclei. 

Figure 8 is an image of a gel indicating that TAT-CMPK did not 
stimulate editing. McArdle cells were treated with 45 nM, 225 nM and 1 125 nM of 
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recombinant TAT-CMPK for 24 h. Total cellular RNA was isolated and 
apolipoprotein B mRNA was selectively amplified by reverse transcription-polymerase 
chain reaction ("RT-PCR") and the proportion of edited apolipoprotein B RNA 
determined by poisoned primer extension as described in the Examples. CAA, primer 
5 extension product corresponding to unedited RNA; UAA, primer extension product 
corresponding to edited RNA; P, primer. 

Figure 9 is an image of a gel indicating that TAT-rAPOBEC-CMPK 
increased editing activity in McArdle cells. The TAT fusion protein (360 nM or 62 |ig 
protein/ml media) was added into cell culture media and RNAs were isolated 

10 subsequent to treatment from wild type McArdle cells at the indicated time points. 

Control cells were treated with a corresponding aliquot of buffer B used to dialyze the 
recombinant protein. The editing efficiency was calculated as described in the 
Examples. The standard deviations for each of the lanes on the gel, reading left to 
right, are as follows: 0.9, 2.2, 3.8, 2.1, 1.1, 0.9, 0.2, k=3. CAA, primer extension 

1 5 product corresponding to unedited RNA; UAA, primer extension product 
corresponding to edited RNA; P, primer. 

Figure 10 is an image of a gel indicating that TAT fusion protein 
increased editing activity in primary rat hepatocytes. Hepatocytes were prepared and 
treated with TAT-rAPOBEC-CMPK as described in the Examples. Control cells were 

20 treated with a corresponding aliquot of buffer B used to dialyze the recombinant 
protein. The increase in editing activity caused by TAT fusion protein was apparent. 
The standard deviations for each of the lanes on the gel, reading left to right, are as 
follows: 2.2,3.6, 2.5, 1.9, «=3. 

Figure 1 1 is an image showing the changes in secreted lipoprotein 

25 profile due to TAT-rAPOBEC-CMPK treatment. Primary hepatocytes were treated 
with TAT fusion protein first, then labeled with [ 35 S]methionine and [ 33 S]cysteine. 
Control cells (-) were treated with a corresponding aliquot of buffer B used to dialyze 
the recombinant protein. Cell culture media were collected, apolipoprotein B48 and 
apolipoprotein B100 were precipitated by anti-apoB antibody and separated by SDS- 

30 PAGE. The second band below apolipoprotein B48 might have been due to protein 
degradation and the band between apolipoprotein B100 and apolipoprotein B48 could 
be C-3 complement. The editing efficiency of the same cells is shown at the bottom. 
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The results are from a single experiment representative three experiments with similar 
results. 

DETAILED DESCRIPTION OF THE INVENTION 

5 

The present invention relates to protein-mediated approaches for 
regulating apolipoprotein B mRNA editing and, therefore, regulating the relative 
concentration of secreted apolipoprotein B derivatives, which offers an approach for 
controlling the serum levels of atherogenic disease factors such as low density 
lipoproteins ("LDL") which associates with apolipoprotein B and its derivatives. 

According to one aspect of the present invention, a first chimeric 
protein is provided for such uses. The first chimeric protein includes a first 
polypeptide that includes a protein transduction domain and a second polypeptide that 
includes APOBEC-1 or a fragment thereof which can edit mRNA encoding 
apolipoprotein B. 

The first polypeptide can be any protein, or polypeptide fragment 
thereof, which is suitable for inducing cellular uptake of the chimeric protein. 

By way of example, protein transduction domains from several known 
proteins can be employed, including without limitation, HTV-1 Tat protein, Drosophila 
homeotic transcription factor (ANTP), and HSV-1 VP22 transcription factor 
(Schwarze et al., "In vivo protein transduction: Intracellular delivery of biologically 
active proteins, compounds, and DNA," TiPS 21:45-48 (2000), which is hereby 
incorporated by reference in its entirety). 

A preferred protein transduction domain is the protein transduction 
domain of the human immunodeficiency virus ("HIV") tat protein. An exemplary HIV 
tat protein transduction domain has an amino acid sequence of SEQ ID No: 9 as 
follows: 

Arg Lys Lys Arg Arg Gin Arg Arg Arg 
5 

This protein transduction domain has also been noted to be a nuclear translocation 
domain (HTV Sequence Compendium 2000 , Kuiken et al. (eds.), Theoretical Biology 
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and Biophysics Group, Los Alamos National Laboratory, which is hereby incorporated 
by reference in its entirety). One DNA molecule which encodes the HTV tat protein 
transduction domain has a nucleotide sequence of SEQ ID No: 10 as follows: 

5 agaaaaaaaa gaagacaaag aagaaga 27 

Variations of these tat sequences can also be employed. Such sequence variants have 
been reported in HTV Sequence Compendium 2000 , Kuiken et al. (eds.), Theoretical 
Biology and Biophysics Group, Los Alamos National Laboratory, which is hereby 

10 incorporated by reference in its entirety. 

Other cellular uptake polypeptides and their use have been described in 
the literature, including membrane-permeable sequences of the SN50 peptide, the Grb2 
SH2 domain, and integrin p 3 , p l5 and am, cytoplasmic domains (Hawiger, Noninvasive 
intracellular delivery of functional peptides and proteins," Curr. Opin. Chem. Biol. 

15 3 : 89-94 ( 1 999), which is hereby incorporated by reference in its entirety). 

The second polypeptide can be either a full length APOBEC-1 or a 
fragment thereof which includes the catalytic domain thereof. The APOBEC-1 protein 
or fragment thereof is a mammalian APOBEC-1 protein or fragment thereof, including 
without limitation, human, rat, mouse, etc. 

20 The full length human APOBEC-1 has an amino acid sequence 

according to SEQ ID No: 1 1 as follows: 

Met Thr Ser Glu Lys Gly Pro Ser Thr Gly Asp Pro Thr Leu Arg Arg 
15 10 15 

25 

Arg lie Glu Pro Trp Glu Phe Asp Val Phe Tyr Asp Pro Arg Glu Leu 
20 25 30 

Arg Lys Glu Ala Cys Leu Leu Tyr Glu He Lys Trp Gly Met Ser Arg 
30 35 40 45 

Lys He Trp Arg Ser Ser Gly Lys Asn Thr Thr Asn His Val Glu Val 
50 55 60 

35 Asn Phe He Lys Lys Phe Thr Ser Glu Arg Asp Phe His Pro Ser He 
65 70 75 80 

Ser Cys Ser He Thr Trp Phe Leu Ser Trp Ser Pro Cys Trp Glu Cys 
85 90 95 

40 

Ser Gin Ala He Arg Glu Phe Leu Ser Arg His Pro Gly Val Thr Leu 



WO 02/068676 



PCT/US02/05824 



5 



20 



50 



- 14- 



100 105 no 

Val lie Tyr Val Ala Arg Leu Phe Trp His Met Asp Gin Gin Asn Arg 
115 " 120 125 

Gin Gly Leu Arg Asp Leu Val Asn Ser Gly Val Thr He Gin He Met 
130 135 140 



Arg Ala Ser Glu Tyr Tyr His Cys Trp Arg Asn Phe Val Asn Tyr Pro 

10 145 150 155 160 

Pro Gly Asp Glu Ala His Trp Pro Gin Tyr Pro Pro Leu Trp Met Met 
165 170 175 

15 Leu Tyr Ala Leu Glu Leu His Cys He He Leu Ser Leu Pro Pro Cys 
180 185 190 



Leu Lys He Ser Arg Arg Trp Gin Asn His Leu Thr Phe Phe Arg Leu 
195 200 205 

His Leu Gin Asn Cys His Tyr Gin Thr He Pro Pro His lie Leu Leu 
210 215 220 



Ala Thr Gly Leu lie His Pro Ser Val Ala Trp Arg 
25 225 230 235 



This human APOBEC-1 sequence is reported at Genbank Accession No. NP_001635, 
which is hereby incorporated by reference in its entirety. The full length human 
APOBEC-1 is believed to include a putative bipartite nuclear localization signal 
30 between amino acid residues 1 5-34, a catalytic center between amino acid residues 61- 
98, and a putative cytoplasmic retention signal between amino acid residues 173-229. 
A cDNA sequence which encodes the full length human APOBEC-1 is set forth as 
SEQ ID No: 12 as follows: 



35 atgacttctg agaaaggtcc ttcaaccggt gaccccactc tgaggagaag aatcgaaccc 60 

tgggagtttg acgtcttcta tgaccccaga gaacttcgta aagaggcctg tctgctctac 120 

gaaatcaagt ggggcatgag ccggaagatc tggcgaagct caggcaaaaa caccaccaat 180 

cacgtggaag ttaattttat aaaaaaattt acgtcagaaa gagattttca cccatccatc 240 

agctgctcca tcacctggtt cttgtcctgg agtccctgct gggaatgctc ccaggctatt 300 

40 agagagtttc tgagtcggca ccctggtgtg actctagtga tctacgtagc tcggcttttt 360 

tggcacatgg atcaacaaaa tcggcaaggt ctcagggacc ttgttaacag tggagtaact 420 

attcagatta tgagagcatc agagtattat cactgctgga ggaattttgt caactaccca 480 

cctggggatg aagctcactg gccacaatac ccacctctgt ggatgatgtt gtacgcactg 540 

gagctgcact gcataattct aagtcttcca ccctgtttaa agatttcaag aagatggcaa 600 

45 aatcatctta catttttcag acttcatctt caaaactgcc attaccaaac gattccgcca 660 

cacatccttt tagctacagg gctgatacat ccttctgtgg cttggagatg a 711 

The full length rat APOBEC-1 has an amino acid sequence according to 
SEQ ID No: 13 as follows: 



Met Ser Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg 
1 5 10 15 
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Arg lie Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu 
20 25 30 

5 Arg Lys Glu Thr Cys Leu Leu Tyr Glu He Asn Trp Gly Gly Arg His 
35 40 45 

Ser He Trp Arg His Thr Ser Gin Asn Thr Asn Lys His Val Glu Val 
50 55 60 

10 

Asn Phe He Glu Lys Phe Thr Thr Glu Arg Tyr Phe Cys Pro Asn Thr 
65 70 ~ 75 80 

Arg Cys Ser He Thr Trp Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys 
15 85 ' 90 95 

Ser Arg Ala He Thr Glu Phe Leu Ser Arg Tyr Pro His Val Thr Leu 
100 105 110 

20 Phe lie Tyr He Ala Arg Leu Tyr His His Ala Asp Pro Arg Asn Arg 
115 ~ 120 125 

Gin Gly Leu Arg Asp Leu lie Ser Ser Gly Val Thr lie Gin lie Met 
130 135 140 

25 

Thr Glu Gin Glu Ser Gly Tyr Cys Trp Arg Asn Phe Val Asn Tyr Ser 
145 150 155 160 

Pro Ser Asn Glu Ala His Trp Pro Arg Tyr Pro His Leu Trp Val Arg 
30 165 170 175 

Leu Tyr Val Leu Glu Leu Tyr Cys He lie Leu Gly Leu Pro Pro Cys 
180 185 190 

35 Leu Asn lie Leu Arg Arg Lys Gin Pro Gin Leu Thr Phe Phe Thr lie 
195 200 205 

Ala Leu Gin Ser Cys His Tyr Gin Arg Leu Pro Pro His lie Leu Trp 
210 215 220 

40 

Ala Thr Gly Leu Lys 
225 



45 This rat APOBEC-1 sequence is reported at Genbank Accession No. P38483, which is 
hereby incorporated by reference in its entirety. Recombinant studies using rat 
APOBEC-1 have demonstrated that an N-terminal region, containing the putative 
nuclear localization signal, is required for nuclear distribution of APOBEC-1 while a 
C-terminal region, containing a putative cytoplasmic retention signal (Yang et a]., 

50 "Multiple protein domains determine the cell type-specific nuclear distribution of the 
catalytic subunit required for apolipoprotein B mRNA editing," Proc. Natl Acad. Sci. 
USA 94: 13075-13080 (1997), which is hereby incorporated by reference in its entirety. 
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A cDNA sequence which encodes the full length rat APOBEC-1 is set forth as SEQ ID 
No: 14 as follows: 



atgagttccg 
5 cacgagtttg 
gagatcaact 
cacgttgaag 
agatgctcca 
acagaat ttt 

10 caccacgcag 
atccagatca 
ccttcgaatg 
gaactctact 
cctcaactca 

15 cacatcctgt 



agacaggccc 
aagtcttctt 
ggggaggaag 
tcaatttcat 
ttacctggtt 
tgagccgata 
atcctcgaaa 
tgacggagca 
aagctcattg 
gcatcatttt 
cgtttttcac 
gggccacagg 



tgtagctgtt 
tgacccccgg 
gcacagcatc 
agaaaaatt t 
cctgtcctgg 
cccccatgta 
tcggcaagga 
agagtctggc 
gccaaggtac 
aggacttcca 
gattgctctt 
gttgaaatga 



gaCcccactc 
gaacttcgga 
tggcgacaca 
actacagaaa 
agtccctgtg 
actctgttta 
ctcagggacc 
tactgctgga 
ccccatctgt 
ccctgtttaa 
caaagctgcc 



tgaggagaag 
aagagacctg 
cgagccaaaa 
gatact Cttg 
gggagtgctc 
tttatatagc 
t tattagcag 
ggaattttgt 
gggtgaggct 
atattttaag 
attaccaaag 



aattgagccc 
tctgctgtat 
caccaacaaa 
tccaaacacc 
cagggccatt 
acggctttat 
cggtgt tact 
caactactcc 
gtacgtactg 
aagaaaacaa 
gctaccaccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

690 



The cDNA molecule is reported at Genbank Accession No. L071 14, which is hereby 
incorporated by reference in its entirety. 
20 The full length mouse APOBEC-1 has an amino acid sequence 

according to SEQ ID No: 15 as follows: 

Met Ser Ser Glu Thr Gly Pro Val Ala Val Asp Pro Thr Leu Arg Arg 
1 5 10 15 

25 

Arg He Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu 
20 25 30 

Arg Lys Glu Thr Cys Leu Leu Tyr Glu He Asn Trp Gly Gly Arg His 
30 35 40 45 

Ser Val Trp Arg His Thr Ser Gin Asn Thr Ser Asn His Val Glu Val 
50 55 60 

35 Asn Phe Leu Glu Lys Phe Thr Thr Glu Arg Tyr Phe Arg Pro Asn Thr 
65 70 75 ~ 80 

Arg Cys Ser He Thr Trp Phe Leu Ser Trp Ser Pro Cys Gly Glu Cys 
85 90 95 

40 

Ser Arg Ala He Thr Glu Phe Leu Ser Arg His Pro Tyr Val Thr Leu 
100 105 110 

Phe He Tyr He Ala Arg Leu Tyr His His Thr Asp Gin Arg Asn Arg 
45 115 120 125 

Gin Gly Leu Arg Asp Leu He Ser Ser Gly Val Thr lie Gin He Met 
130 135 140 

50 Thr Glu Gin Glu Tyr Cys Tyr Cys Trp Arg Asn Phe Val Asn Tyr Pro 
145 150 155 160 

Pro Ser Asn Glu Ala Tyr Trp Pro Arg Tyr Pro His Leu Trp Val Lys 
165 170 175 

55 
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Leu Tyr Val Leu Glu Leu Tyr Cys lie He Leu Gly Leu Pro Pro Cys 
180 185 190 

Leu Lys He Leu Arg Arg Lys Gin Pro Gin Leu Thr Phe Phe Thr He 
5 195 * 200 205 

Thr Leu Gin Thr Cys His Tyr Gin Arg He Pro Pro His Leu Leu Trp 
210 215 220 

10 Ala Thr Gly Leu Lys 
225 



This mouse APOBEC-1 sequence is reported at Genbank Accession No. NP_1 12436, 
1 5 which is hereby incorporated by reference in its entirety. A cDNA sequence which 
encodes the full length mouse APOBEC-1 is set forth as SEQ ID No: 16 as follows: 



atgagttccg agacaggccc tgtagctgtt 

cacgagtttg aagtcttctt tgacccccgg 

20 gagatcaact ggggtggaag gcacagtgtc 

cacgttgaag tcaacttctt agaaaaattt 

agatgctcca ttacctggtt cctgtcctgg 

acagagtttc tgagccgaca cccctatgta 

caccacacgg atcagcgaaa ccgccaagga 

25 atccagatca tgacagagca agagtattgt 

ccttcaaacg aagcttattg gccaaggtac 

gagctctact gcatcatttt aggacttcca 

cctcaactca cgtttttcac aattactctt 

catctccttt gggctacagg gttgaaatga 

30 



gatcccactc tgaggagaag aattgagccc 60 
gagcttcgga aagagacctg tctgctgtat 120 
tggcgacaca cgagccaaaa caccagcaac 180 
actacagaaa gatactttcg tccgaacacc 240 
agtccctgcg gggagtgctc cagggccatt 300 
actctgttta tttacatagc acggctttat 360 
ctcagggacc ttattagcag cggtgtgact 420 
tactgctgga ggaatttcgt caactacccc 480 
ccccatctgt gggtgaaact gtatgtattg 540 
ccctgtttaa aaattttaag aagaaagcaa 600 
caaacctgcc attaccaaag gataccaccc 660 

690 



The cDNA molecule is reported at Genbank Accession No. NM_03 1 1 59, which is 
hereby incorporated by reference in its entirety. 

The first chimeric protein of the present invention can also include one 
or more other polypeptide sequences, including without limitation: (i) a polypeptide 

35 that includes a cytoplasmic localization protein or a fragment thereof which, upon 
cellular uptake of the first chimeric protein, localizes the first chimeric protein to the 
cytoplasm; (ii) a polypeptide that includes a plurality of adjacent histidine residues; and 
(iii) a polypeptide that includes an epitope tag. 

The polypeptide that includes a cytoplasmic localization protein or a 

40 fragment thereof can be any protein, or fragment thereof, which can effectively retain 
the first chimeric protein within the cytoplasm of a cell into which the first chimeric 
protein has been translocated. One such protein is chicken muscle pyruvate kinase 
("CMPK"), which has an amino acid sequence of SEQ ID No: 17 as follows: 
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Met Ser Lys His His Asp Ala Gly Thr Ala Phe lie Gin Thr Gin Gin 
15 10 15 

Leu His Ala Ala Met Ala Asp Thr Phe Leu Glu His Met Cys Arg Leu 
5 20 25 30 

Asp lie Asp Ser Glu Pro Thr lie Ala Arg Asn Thr Gly lie lie Cys 
35 40 45 

10 Thr He Gly Pro Ala Ser Arg Ser Val Asp Lys Leu Lys Glu Met He 
50 55 60 



15 



Lys Ser Gly Met Asn Val Ala Arg Leu Asn Phe Ser His Gly Thr His 
65 ^ 70 75 80 

Glu Tyr His Glu Gly Thr He Lys Asn Val Arg Glu Ala Thr Glu Ser 
85 90 95 



Phe Ala Ser Asp Pro He Thr Tyr Arg Pro Val Ala He Ala Leu Asp 
20 100 105 110 

Thr Lys Gly Pro Glu lie Arg Thr Gly Leu He Lys Gly Ser Gly Thr 
115 120 125 

25 Ala Glu Val Glu Leu Lys Lys Gly Ala Ala Leu Lys Val Thr Leu Asp 
130 135 140 



30 



Asn Ala Phe Met Glu Asn Cys Asp Glu Asn Val Leu Trp Val Asp Tyr 

145 150 155 160 

Lys Asn Leu He Lys Val He Asp Val Gly Ser Lys He Tyr Val Asp 

165 170 175 



Asp Gly Leu lie Ser Leu Leu Val Lys Glu Lys Gly Lys Asp Phe Val 
35 180 185 190 

Met Thr Glu Val Glu Asn Gly Gly Met Leu Gly Ser Lys Lys Gly Val 
195 200 205 

40 Asn Leu Pro Gly Ala Ala Val Asp Leu Pro Ala Val Ser Glu Lys Asp 
210 215 220 



45 



He Gin Asp Leu Lys Phe Gly Val Glu Gin Asn Val Asp Met Val Phe 

225 230 235 240 

Ala Ser Phe He Arg Lys Ala Ala Asp Val His Ala Val Arg Lys Val 

245 250 255 



Leu Gly Glu Lys Gly Lys His He Lys He He Ser Lys He Glu Asn 
50 260 265 270 

His Glu Gly Val Arg Arg Phe Asp Glu He Met Glu Ala Ser Asp Gly 
275 280 285 

55 He Met Val Ala Arg Gly Asp Leu Gly He Glu lie Pro Ala Glu Lys 
290 295 300 



60 



Val Phe Leu Ala Gin Lys Met Met He Gly Arg Cys Asn Arg Ala Gly 

305 310 315 320 

Lys Pro lie lie Cys Ala Thr Gin Met Leu Glu Ser Met lie Lys Lys 

325 330 335 
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Pro Arg Pro Thr Arg Ala Glu Gly Ser Asp Val Ala Asn Ala Val Leu 
340 345 350 

5 Asp Gly Ala Asp Cys lie Met Leu Ser Gly Glu Thr Ala Lys Gly Asp 
355 ' 360 365 

Tyr Pro Leu Glu Ala Val Arg Met Gin His Ala lie Ala Arg Glu Ala 
370 375 380 

10 

Glu Ala Ala Met Phe His Arg Gin Gin Phe Glu Glu He Leu Arg His 
385 390 395 400 

Ser Val His His Arg Glu Pro Ala Asp Ala Met Ala Ala Gly Ala Val 
15 405 410 415 

Glu Ala Ser Phe Lys Cys Leu Ala Ala Ala Leu He Val Met Thr Glu 
420 425 430 

20 Ser Gly Arg Ser Ala His Leu Val Ser Arg Tyr Arg Pro Arg Ala Pro 
435 440 445 

He He Ala Val Thr Arg Asn Asp Gin Thr Ala Arg Gin Ala His Leu 
450 455 460 

25 

Tyr Arg Gly Val Phe Pro Val Leu Cys Lys Gin Pro Ala His Asp Ala 
465 470 475 480 

Trp Ala Glu Asp Val Asp Leu Arg Val Asn Leu Gly Met Asn Val Gly 
30 485 490 495 

Lys Ala Arg Gly Phe Phe Lys Thr Gly Asp Leu Val lie Val Leu Thr 
500 505 510 

35 Gly Trp Arg Pro Gly Ser Gly Tyr Thr Asn Thr Met Arg Val Val Pro 
515 520 525 

Val Pro 
530 

40 

A DNA molecule encoding the full length CMPK has a nucleotide sequence according 
to SEQ ID No: 18 as follows: 



atgtcgaagc accacgatgc agggaccgct ttcatccaga cccagcagct gcacgctgcc 60 

45 atggcagaca cctttctgga gcacatgtgc cgcctggaca tcgactccga gccaaccatt 120 

gccagaaaca ccggcatcat ctgcaccatc ggcccagcct cccgctctgt ggacaagctg 180 

aaggaaatga ttaaatctgg aatgaatgtt gcccgcctca acttctcgca cggcacccac 240 

gagtatcatg agggcacaat taagaacgtg cgagaggcca cagagagctt tgcctctgac 300 

ccgatcacct acagacctgt ggctattgca ctggacacca agggacctga aatccgaact 360 

50 ggactcatca agggaagtgg cacagcagag gtggagctca agaagggcgc agctctcaaa 42 0 

gtgacgctgg acaatgcctt catggagaac tgcgatgaga atgtgctgtg ggtggactac 480 

aagaacctca tcaaagttat agatgtgggc agcaaaatct atgtggatga cggtctcatt 540 

tccttgctgg ttaaggagaa aggcaaggac tttgtcatga ctgaggttga gaacggtggc 600 

atgcttggta gtaagaaggg agtgaacctc ccaggtgctg cggtcgacct gcctgcagtc 660 

55 tcagagaagg acattcagga cctgaaattt ggcgtggagc agaatgtgga catggtgttc 720 

gcttccttca tccgcaaagc tgctgatgtc catgctgtca ggaaggtgct aggggaaaag 780 

ggaaagcaca tcaagattat cagcaagatt gagaatcacg agggtgtgcg caggtttgat 840 

gagatcatgg aggccagcga tggcattatg gtggcccgtg gtgacctggg Cattgagatc 900 

cctgctgaaa aagtcttcct cgcacagaag atgatgattg ggcgctgcaa cagggctggc 960 

60 aaacccatca tttgtgccac tcagatgttg gaaagcatga tcaagaaacc tcgcccgacc 1020 

cgcgctgagg gcagtgatgt tgccaatgca gttctggatg gagcagactg catcatgctg 1080 
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tctggggaga ccgccaaggg agactaccca ctggaggctg 
gctcgtgagg ctgaggccgc aatgttccat cgtcagcagt 
agtgtacacc acagggagcc tgctgatgcc atggcagcag 
aagtgcttag cagcagctct gatagttatg accgagtctg 
5 tcccggtacc gcccgcgggc tcccatcatc gccgtcaccc 
caggcacacc tgtaccgcgg cgtcttcccc gtgctgtgca 
tgggcagagg atgtggatct ccgtgtgaac ctgggcatga 
ttcttcaaga ccggggacct ggtgatcgtg ctgacgggct 
accaacacca tgcgggtggt gcccgtgcca tga 

10 

The amino acid sequence and nucleotide sequence for the full length CMPK is reported 
at Genbank Accession Nos. AAA49021 and J00903, respectively, each of which is 
hereby incorporated by reference in its entirety. 

Fragments of CMPK which afford cytoplasmic retention of the first 
1 5 chimeric protein include, without limitation, polypeptides containing at a minimum 
residues 1-479 of SEQ ID No: 18. 

The polypeptide that includes a plurality of histidine residues preferably 
contains a sufficient number of histidine residues so as to allow the first chimeric 
protein containing such histidine residues to be bound by an antibody which recognizes 
20 the plurality of histidine residues. One type of DNA molecule encoding H n is (cac) m 
where n is greater than 1, but preferably greater than about 5. This His region can be 
used during immuno-purification, which is described in greater detail below. 

The polypeptide that includes an epitope tag can be any epitope tag that 
is recognized with antibodies raised against the epitope tag. An exemplary epitope tag 
25 is a hemagglutinin ("HA") domain. The HA domain is present only when it is desirable 
to examine, i.e., in vitro, localization of the first chimeric protein within cells that have 
translocated it. One suitable HA domain has an amino acid sequence according to 
SEQ ID No: 19 as follows: 

30 Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 

This HA sequence is encoded by a DNA molecule having a nucleotide sequence 
according to SEQ ID No: 20 as follows: 

35 

tacccctacg acgtgcccga ctacgcc 27 



tgcgcatgca gcacgctatt 1140 
ttgaagaaat cttacgccac 1200 
gcgcggtgga ggcctccttt 1260 
gcaggtctgc acacctggtg 1320 
gcaatgacca aacagcacgc 1380 
agcagccggc ccacgatgcc 1440 
atgtcggcaa agcccgtgga 1500 
ggcgccccgg ctccggctac 1560 

1593 
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An exemplary first chimeric protein of the present invention which is 
suitable for use in humans, designated TAT-hAPOBEC-CMPK, is set forth in Figure 
1 A. This first chimeric protein (human) includes: an N-terminal HIV tat protein 
transduction domain, a hemagglutinin domain, a polypeptide fragment of human 
5 APOBEC-1 , a CMPK domain, and a C-terminal His tag. The amino acid sequence 
(SEQ ID No: 2) and encoding nucleotide sequence (SEQ ID No: 1) of this exemplary 
first chimeric protein (human) is set forth in Figures ID and 1B-C, respectively. 

An exemplary first chimeric protein of the present invention which is 
suitable for use in rats, designated TAT-rAPOBEC-CMPK, is set forth in Figure 2A. 

10 This first chimeric protein (rat) includes: an N-terminal HTV tat protein transduction 
domain, a hemagglutinin domain, a polypeptide fragment of rat APOBEC- 1 , a CMPK 
domain, and a C-terminal His tag. The amino acid sequence (SEQ ID No: 4) and 
encoding nucleotide sequence (SEQ ID No: 3) of this exemplary first chimeric protein 
(rat) is set forth in Figures 2D and 2B-C, respectively. 

1 5 According to a second aspect of the present invention, a second 

chimeric protein is provided for use in combination with the first chimeric protein 
described above. The second chimeric protein includes a first polypeptide that includes 
a protein transduction domain and a second polypeptide the includes ACF or a 
fragment thereof which can bind to apolipoprotein B mRNA. 

20 The first polypeptide of the second chimeric protein can be a protein 

transduction domain of the type described above. The protein transduction domain of 
the second chimeric protein can be the same or different from the protein transduction 
domain of the first chimeric protein. 

The second polypeptide of the second chimeric protein, as noted above, 

25 includes ACF or a fragment thereof which can bind to apolipoprotein B mRNA. 

Although it has been proposed that a number of different proteins assist APOBEC- 1 in 
editing apolipoprotein B mRNA, ACF has been identified as the minimal protein 
complement for editing in vitro in the human system (Mehta et al., Molecular cloning 
of apobec-1 complementation factor, a novel RNA binding protein involved in the 

30 editing of apo B mRNA," Mol. Cell. Biol 20: 1846-1854 (2000), which is hereby 
incorporated by reference in its entirety). In accordance with the present invention, 
therefore, the second chimeric protein binds apolipoprotein B mRNA at the mooring 
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sequence and through its interactions with the first chimeric protein, sequesters the 
first chimeric protein to the cytidine of the apolipoprotein B mRNA to be edited (i.e., 
at position 6666), thereby resulting in its conversion to a uridine. As noted above, this 
conversion results in a stop codon that contributes to expression of the apolipoprotein 
5 B48 derivative. 

Recent studies have suggested that APOBEC-1 requires a chaperone 
for its nuclear localization (Yang et al, "Intracellular trafficking determinants in 
APOBEC-1, the catalytic subunit for cytidine to uridine editing of apolipoprotein B 
mRNA/' Exp. Cell Res. 267: 153-164 (2001), which is hereby incorporated by 
10 reference in its entirety). More recently, however, it has been learned that APOBEC-1 
is most likely associated with ACF throughout the cell and, therefore, it may import to 
the nucleus as an APOBEC-1 /ACF complex. A bipartite nuclear localization signal is 
predicted in ACF (see below). 

ACF is expressed at sufficient levels within the hepatic cells of rat 
15 (Dance et al., "Two proteins essential for apolipoprotein B mRNA editing are 
expressed from a single gene through alternative splicing," J. Biol. Cheiru 
electronically published as manuscript Ml 1 1337200 (2002), which is hereby 
incorporated by reference in its entirety), such that augmenting of the intracellular ACF 
concentration is not needed. However, to optimize apolipoprotein B mRNA editing, in 
20 some instances it may be desirable to increase the intracellular concentration of ACF. 

The full length rat ACF has an amino acid sequence according to SEQ 
ID No: 21 as follows: 

Met Glu Ser Asn His Lys Ser Gly Asp Gly Leu Ser Gly Thr Gin Lys 
25 1 5 10 15 

Glu Ala Ala Leu Arg Ala Leu Val Gin Arg Thr Gly Tyr Ser Leu Val 
20 25 30 

30 Gin Glu Asn Gly Gin Arg Lys Tyr Gly Gly Pro Pro Pro Gly Trp Asp 
35 40 45 

Thr Thr Pro Pro Glu Arg Gly Cys Glu He Phe He Gly Lys Leu Pro 
50 55 60 

35 

Arg Asp Leu Phe Glu Asp Glu Leu He Pro Leu Cys Glu Lys He Gly 
65 70 75 * 80 

Lys He Tyr Glu Met Arg Met Met Met Asp Phe Asn Gly Asn Asn Arg 
40 85 90 95 
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Gly Tyr Ala Phe Val Thr Phe Ser Asn Lys Gin Glu Ala Lys Asn Ala 
100 105 110 

lie Lys Gin Leu Asn Asn Tyr Glu lie Arg Asn Gly Arg Leu Leu Gly 
115 120 125 

Val Cys Ala Ser Val Asp Asn Cys Arg Leu Phe Val Gly Gly He Pro 
130 135 140 

Lys Thr Lys Lys Arg Glu Glu He Leu Ser Glu Mec Lys Lys Val Thr 
145 150 155 160 



Glu Gly Val Val Asp Val He Val Tyr Pro Ser Ala Ala Asp Lys Thr 
15 165 170 175 

Lys Asn Arg Gly Phe Ala Phe Val Glu Tyr Glu Ser His Arg Ala Ala 
180 185 190 

20 Ala Met Ala Arg Arg Arg Leu Leu Pro Gly Arg He Gin Leu Trp Gly 
195 200 205 



25 



His Pro lie Ala Val Asp Trp Ala Glu Pro Glu Val Glu Val Asp Glu 
210 215 220 

Asp Thr Met Ser Ser Val Lys lie Leu Tyr Val Arg Asn Leu Met Leu 
225 230 235 240 



Ser Thr Ser Glu Glu Met lie Glu Lys Glu Phe Asn Ser lie Lys Pro 
30 245 250 255 

Gly Ala Val Glu Arg Val Lys Lys lie Arg Asp Tyr Ala Phe Val His 

260 265 270 

35 Phe Ser Asn Arg Glu Asp Ala Val Glu Ala Met Lys Ala Leu Asn Gly 
275 280 285 



40 



Lys Val Leu Asp Gly Ser Pro lie Glu Val Thr Leu Ala Lys Pro Val 
290 295 300 

Asp Lys Asp Ser Tyr Val Arg Tyr Thr Arg Gly Thr Gly Gly Arg Asn 
305 310 315 320 



Thr Met Leu Gin Glu Tyr Thr Tyr Pro Leu Ser His Val Tyr Asp Pro 

45 325 330 335 

Thr Thr Thr Tyr Leu Gly Ala Pro Val Phe Tyr Thr Pro Gin Ala Tyr 

340 345 350 

50 Ala Ala lie Pro Ser Leu His Phe Pro Ala Thr Lys Gly His Leu Ser 

355 360 365 



55 



Asn Arg Ala Leu He Arg Thr Pro Ser Val Arg Glu lie Tyr Met Asn 

370 375 380 

Val Pro Val Gly Ala Ala Gly Val Arg Gly Leu Gly Gly Arg Gly Tyr 

385 390 395 400 



Leu Ala Tyr Thr Gly Leu Gly Arg Gly Tyr Gin Val Lys Gly Asp Lys 
60 405 410 415 



Arg Gin Asp Lys Leu Tyr Asp Leu Leu Pro Gly Met Glu Leu Thr Pro 
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420 425 430 

Met Asn Thr lie Ser Leu Lys Pro Gin Gly Val Lys Leu Ala Pro Gin 
435 440 445 

5 

He Leu Glu Glu He Cys Gin Lys Asn Asn Trp Gly Gin Pro Val Tyr 
450 455 460 

Gin Leu His Ser Ala He Gly Gin Asp Gin Arg Gin Leu Phe Leu Tyr 
0 465 470 475 480 

Lys Val Thr He Pro Ala Leu Ala Ser Gin Asn Pro Ala He His Pro 
485 490 495 

5 Phe Thr Pro Pro Lys Leu Ser Ala Tyr Val Asp Glu Ala Lys Arg Tyr 
500 505 510 



:0 



5 



Ala Ala Glu His Thr Leu Gin Thr Leu Gly lie Pro Thr Glu Gly Gly 
515 520 525 

Asp Ala Gly Thr Thr Ala Pro Thr Ala Thr Ser Ala Thr Val Phe Pro 
530 535 540 



Gly Tyr Ala Val Pro Ser Ala Thr Ala Pro Val Ser Thr Ala Gin Leu 
:5 545 550 555 560 

Lys Gin Ala Val Thr Leu Gly Gin Asp Leu Ala Ala Tyr Thr Thr Tyr 
565 570 575 

0 Glu Val Tyr Pro Thr Phe Ala Val Thr Thr Arg Gly Asp Gly Tyr Gly 
580 585 590 



Thr Phe 



A DNA molecule encoding the full length rat ACF has a nucleotide sequence 
according to SEQ ID No: 22 as follows: 



atggaatcaa 
o cgcgcactgg 
ggtggtcctc 
gggaaacttc 
aaaatttatg 
gtaaccttct 
5 attcggaatg 
gggggaatcc 
gaaggagttg 
tttgcctttg 
ccaggaagaa 
o gaagttgacg 
tctacctcgg 
cgggtgaaga 
gaagccatga 
gccaagccag 
accatgctgc 
cttggagctc 
ccagctacca 
atttacatga 
ttggcatata 
ctctatgacc 
caaggagtta 



atcacaaatc 
tccagcgcac 
caccaggctg 
cccgggacct 
aaatgagaat 
caaataagca 
gccgtctcct 
ccaaaaccaa 
ttgatgtcat 
tggaatatga 
ttcagttgtg 
aagacacaat 
aagagatgat 
agatccgaga 
aggctttgaa 
tggacaagga 
aagaatacac 
ctgtcttcta 
aaggacatct 
atgtccctgt 
caggcctggg 
ttctgcctgg 
aacttgctcc 



cggggatgga 
aggatatagc 
ggatactaca 
ttttgaggat 
gatgatggat 
ggaagccaag 
gggcgtctgt 
aaagagagaa 
tgtctaccca 
gagtcaccgc 
gggacatcct 
gtcttccgtg 
tgagaaggaa 
ctatgctttt 
tggcaaggtg 
cagttacgtt 
ctaccctctg 
tactccccaa 
cagcaacaga 
aggggctgcg 
tcgaggatac 
gatggagctc 
tcagatatta 



ttgagcggca 
ttggtccagg 
cccccagaaa 
gaactcatac 
ttcaatggga 
aatgcaatca 
gccagtgtgg 
gaaatcttgt 
agcgctgccg 
gcagccgcca 
atcgcagtag 
aaaatcctgt 
ttcaacagta 
gtgcatttca 
ctggatggtt 
aggtacaccc 
agccatgttt 
gcctacgcag 
gctctcatcc 
ggcgtgagag 
caggtcaaag 
accccgatga 
gaagaaatct 



cccagaagga 
aaaatggaca 
ggggctgcga 
cattgtgtga 
acaacagagg 
agcaacttaa 
acaactgccg 
cagagatgaa 
ataaaaccaa 
tggctaggcg 
actgggcaga 
acgtaaggaa 
ttaaaccagg 
gtaaccgaga 
ccccaataga 
ggggcaccgg 
atgaccctac 
ccattccaag 
ggaccccttc 
gactgggcgg 
gagacaagag 
atactatctc 
gtcagaaaaa 



agcagcactc 
aagaaaatat 
gattttcatt 
aaaaattggt 
ctatgcattt 
taattatgaa 
gttgtttgtg 
aaaggtcact 
aaaccggggg 
gaggctgctg 
gccagaagtc 
ccttatgctg 
tgctgtggaa 
agatgcagtt 
agtgaccttg 
gggcaggaac 
cacaacctac 
tcttcatttc 
tgtcagagaa 
ccgtgggtat 
acaagacaaa 
tttaaaacca 
taactgggga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 
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cagccagtgt accagctgca ctctgccatt ggacaagacc aaagacagtt attcctatac 1440 
aaagtaacta tcccagcgct ggccagccag aatcctgcga tccacccttc cacaccccca 1500 
aagctaagcg cctacgtgga tgaagcaaag aggtacgccg cagagcacac cctacagaca 1560 
ctaggcatcc ccacagaagg aggggacgct gggactacag cacccactgc cacatccgcc 1620 
5 actgtgtttc caggatacgc tgtccccagt gccaccgctc ctgtgtctac agcccagctc 1680 
aagcaagcag tgacacttgg acaagactta gcagcatata caacctatga ggtctaccct 1740 
acttttgcag tgaccacccg aggtgatgga tatggcacct tctga 1785 

The amino acid sequence and nucleotide sequence for the full length rat ACF65 is 
10 reported at Genbank Accession Nos. AAK50145 and AY028945, respectively, each of 

which is hereby incorporated by reference in its entirety. In addition, it should be 

noted that a short isoform of rat ACF64 exists, as identified at Genbank Accession No. 

AF290984, which is hereby incorporated by reference in its entirety. 

The full length human ACF has an amino acid sequence according to 
15 SEQ ID No: 23 as follows: 



Met Glu Ser Asn His Lys Ser Gly Asp Gly Leu Ser Gly Thr Gin Lys 
15 10 15 

20 Glu Ala Ala Leu Arg Ala Leu Val Gin Arg Thr Gly Tyr Ser Leu Val 
20 25 30 



25 



40 



Gin Glu Asn Gly Gin Arg Lys Tyr Gly Gly Pro Pro Pro Gly Trp Asp 
35 40 45 

Ala Ala Pro Pro Glu Arg Gly Cys Glu lie Phe lie Gly Lys Leu Pro 
50 55 60 



Arg Asp Leu Phe Glu Asp Glu Leu lie Pro Leu Cys Glu Lys He Gly 
30 65 70 75 80 

Lys He Tyr Glu Met Arg Met Met Met Asp Phe Asn Gly Asn Asn Arg 
85 90 95 

35 Gly Tyr Ala Phe Val Thr Phe Ser Asn Lys Val Glu Ala Lys Asn Ala 
100 105 110 



He Lys Gin Leu Asn Asn Tyr Glu He Arg Asn Gly Arg Leu Leu Gly 
115 120 125 

Val Cys Ala Ser Val Asp Asn Cys Arg Leu Phe Val Gly Gly He Pro 
130 135 140 



Lys Thr Lys Lys Arg Glu Glu He Leu Ser Glu Met Lys Lys Val Thr 
45 145 150 155 160 

Glu Gly Val Val Asp Val He Val Tyr Pro Ser Ala Ala Asp Lys Thr 
165 170 175 

50 Lys Asn Arg Gly Phe Ala Phe Val Glu Tyr Glu Ser His Arg Ala Ala 
180 185 190 

Ala Met Ala Arg Arg Lys Leu Leu Pro Gly Arg He Gin Leu Trp Gly 
195 200 205 
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His Gly He Ala Val Asp Trp Ala Glu Pro Glu Val Glu Val Asp Glu 

210 215 220 

Asp Thr Met Ser Ser Val Lys He Leu Tyr Val Arg Asn Leu Met Leu 

225 230 235 " 240 

Ser Thr Ser Glu Glu Met He Glu Lys Glu Phe Asn Asn He Lys Pro 

245 250 255 

Gly Ala Val Glu Arg Val Lys Lys He Arg Asp Tyr Ala Phe Val His 

260 265 270 



Phe Ser Asn Arg Lys Asp Ala Val Glu Ala Met Lys Ala Leu Asn Gly 

15 275 ~ 280 285 

Lys Val Leu Asp Gly Ser Pro He Glu Val Thr Leu Ala Lys Pro Val 

290 295 300 

20 Asp Lys Asp Ser Tyr Val Arg Tyr Thr Arg Gly Thr Gly Gly Arg Gly 

305 310 315 320 



25 



Thr Met Leu Gin Gly Glu Tyr Thr Tyr Ser Leu Gly Gin Val Tyr Asp 
325 330 335 

Pro Thr Thr Thr Tyr Leu Gly Ala Pro Val Phe Tyr Ala Pro Gin Thr 
340 345 350 



Tyr Ala Ala He Pro Ser Leu His Phe Pro Ala Thr Lys Gly His Leu 
30 355 360 365 

Ser Asn Arg Ala He lie Arg Ala Pro Ser Val Arg Gly Ala Ala Gly 
370 375 380 

35 Val Arg Gly Leu Gly Gly Arg Gly Tyr Leu Ala Tyr Thr Gly Leu Gly 
385 390 395 400 



40 



Arg Gly Tyr Gin Val Lys Gly Asp Lys Arg Glu Asp Lys Leu Tyr Asp 
405 410 415 

lie Leu Pro Gly Met Glu Leu Thr Pro Met Asn Pro Val Thr Leu Lys 
420 425 430 



Pro Gin Gly lie Lys Leu Ala Pro Gin lie Leu Glu Glu He Cys Gin 
45 435 440 445 



50 



Lys Asn Asn Trp Gly Gin Pro Val Tyr Gin Leu His Ser Ala lie Gly 
450 455 460 

Gin Asp Gin Arg Gin Leu Phe Leu Tyr Lys lie Thr He Pro Ala Leu 
465 470 475 480 



55 



Ala Ser Gin Asn Pro Ala lie His Pro Phe Thr Pro Pro Lys Leu Ser 
485 490 495 

Ala Phe Val Asp Glu Ala Lys Thr Tyr Ala Ala Glu Tyr Thr Leu Gin 
500 505 510 



Thr Leu Gly lie Pro Thr Asp Gly Gly Asp Gly Thr Met Ala Thr Ala 
60 515 520 525 



Ala Ala Ala Ala Thr Ala Phe Pro Gly Tyr Ala Val Pro Asn Ala Thr 
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530 535 540 

Ala Pro Val Ser Ala Ala Gin Leu Lys Gin Ala Val Thr Leu Gly Gin 
545 550 555 560 

5 

Asp Leu Ala Ala Tyr Thr Thr Tyr Glu Val Tyr Pro Thr Phe Ala Val 
565 " 570 575 

Thr Ala Arg Gly Asp Gly Tyr Gly Thr Phe 
10 580 585 



A DNA molecule encoding the full length human ACF has a nucleotide sequence 
according to SEQ ID No: 24 as follows: 



15 atggaatcaa atcacaaatc cggggatgga 
cgcgcactgg tccagcgcac aggatatagc 
ggtggccctc cacctggttg ggatgctgca 
ggaaaacttc cccgagacct ttttgaggat 
aaaatttatg aaatgagaat gatgatggat 

20 gtaacatttt caaataaagt ggaagccaag 
attagaaatg ggcgcctctt aggggtttgt 
9999gcatcc caaaaaccaa aaagagagaa 
gaaggtgttg tcgatgtcat cgtctaccca 
tttgccttcg tggagtatga gagtcatcga 

25 ccaggaagaa ttcagttatg gggacatggt 
gaagttgatg aagatacaat gtcttcagtg 
tctacctctg aagagatgat tgaaaaggaa 
agggtgaaga aaattcgaga ctatgctttt 
gaggctatga aagctttaaa tggcaaggtg 

30 gcaaaaccag tggacaagga cagttatgtt 
accatgctgc aaggagagta tacctactct 
taccttggag ctcctgtctt ctatgccccc 
ttcccagcca ccaaaggaca tctcagcaac 
99ggctgcgg gagtgagagg actgggcggc 

35 cgaggatacc aggtcaaagg agacaaaaga 
atggagctca ccccaatgaa tcctgtcaca 
cagatattag aagagatttg tcagaaaaat 
tctgctattg gacaagacca aagacagcta 
gccagccaga atcctgcaat ccaccctttc 

40 gaagcaaaga cgtatgcagc cgaatacacc 
ggcgatggca ccatggctac tgctgctgct 
cctaatgcaa ctgcacccgt gtctgcagcc 
gacttagcag catatacaac ctatgaggtc 
gatggatatg gcaccttctg a 

45 



ttgagcggca ctcagaagga agcagccctc 60 
ttggtccagg aaaatggaca aagaaaatat 120 
ccccctgaaa ggggctgtga aatttttatt 180 
gagcttatac cattatgtga aaaaatcggt 240 
tttaatggca acaatagagg atatgcattt 300 
aatgcaatca agcaacttaa taattatgaa 360 
gccagtgtgg acaactgccg attatttgtt 420 
gaaatcttat cggagatgaa aaaggttact 480 
agcgctgcag ataaaaccaa aaaccgaggc 54 0 
gcagctgcca tggcgaggag gaaactgcta 600 
attgcagtag actgggcaga gccagaagta 660 
aaaatcctat atgtaagaaa tcttatgctg 720 
ttcaacaata tcaaaccagg tgctgtggag 780 
gtgcacttca gtaaccgaaa agatgcagtt 840 
ctggatggtt cccccattga agtcacccta 900 
aggtataccc gaggcacagg tggaaggggc 960 
ttgggccaag tttatgatcc caccacaacc 1020 
cagacctatg cagcaattcc cagtcttcat 1080 
agagccatta tccgagcccc ttctgttaga 1140 
cgtggctatt tggcatacac aggcctgggt 1200 
gaagacaaac tctatgacat tttacctggg 1260 
ttaaaacccc aaggaattaa actcgctccc 1320 
aactggggac agccagtgta ccagctgcac 1380 
ttcttgtaca aaataactat tcctgctcta 1440 
acacctccaa agctgagtgc ctttgtggat 1500 
ctgcagaccc tgggcatccc cactgatgga 1560 
gctgctactg ctttcccagg atatgctgtc 1620 
cagctcaagc aagcggtaac ccttggacaa 1680 
tacccaactt ttgcagtgac tgcccgaggg 1740 

1761 



The amino acid sequence and nucleotide sequence for the full length human ACF is 
reported at Genbank Accession Nos. AAF76221 and AF271789, respectively, each of 
which is hereby incorporated by reference in its entirety. 

In comparing the human and rat ACF homologs, it is apparent that 
50 these proteins share 93.5 percent identity at the amino acid level and, moreover, 

antibodies raised against the human ACF also recognize rat ACF. Is has been reported 
that functional complementation of apolipoprotein B mRNA editing by APOBEC-1 
involves the N-terminal 380 residues of ACF (Blanc et al., "Mutagenesis of Apobec-1 
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complementation factor reveals distinct domains that modulate RNA binding, protein- 
protein interaction with Apobec-1, and complementation of C to U RNA-editing 
activity/* J. Biol. Chem. 276(49): 46386-46393 (2001), which is hereby incorporated 
by reference in its entirety). 
5 The second chimeric protein of the present invention can also include 

one or more other polypeptide sequences, including without limitation: (i) a 
polypeptide that includes a cytoplasmic localization protein or a fragment thereof 
which, upon cellular uptake of the second chimeric protein, localizes the second 
chimeric protein to the cytoplasm, (ii) a polypeptide that includes a plurality of 
10 adjacent histidine residues; and (iii) a polypeptide that includes a hemagglutinin 
domain. Each of these has been described above with respect to the first chimeric 
protein. 

An exemplary second chimeric protein of the present invention which is 
suitable for use in humans, designated TAT-hACF, is set forth in Figure 3A. This 
1 5 second chimeric protein (human) includes: an N-terminal HIV tat protein transduction 
domain, a hemagglutinin domain, a polypeptide fragment of human ACF, and a C- 
terminal His tag. The amino acid sequence (SEQ ID No: 6) and encoding nucleotide 
sequence (SEQ ID No: 5) of this exemplary second chimeric protein (human) is set 
forth in Figures 3B-C. 

20 An exemplary second chimeric protein of the present invention which is 

suitable for use in rats, designated TAT-rACF, is set forth in Figure 4A. This second 
chimeric protein (rat) includes: an N-terminal HIV tat protein transduction domain, a 
hemagglutinin domain, a polypeptide fragment of rat ACF, and a C-terminal His tag. 
The amino acid sequence (SEQ ID No: 8) and encoding nucleotide sequence (SEQ ID 

25 No: 7) of this exemplary second chimeric protein (rat) is set forth in Figures 4B-C. 

DNA molecules encoding the above-identified first and second chimeric 
proteins can be assembled using conventional molecular genetic manipulation for 
subcloning gene fragments, such as described by Sambrook et al, Molecular Cloning: 
A Laboratory Manual Cold Springs Laboratory, Cold Springs Harbor, New York 

30 (1989), and Ausubel et al. (ed.), Current Protocols in Molecular Biology . John Wiley 
& Sons (New York, NY) (1999 and preceding editions), each of which is hereby 
incorporated by reference in its entirety. In conjunction therewith, desired fragments 



WO 02/068676 



PCT/US02/05824 



-29- 

of the APOBEC-1, ACF, or CMPK encoding DNA molecules can be obtained using 
the PCR technique together with specific sets of primers chosen to represent particular 
portions of the protein. Erlich et al., Science 252: 1643-51 (1991), which is hereby 
incorporated by reference in its entirety. 
5 Once the desired DNA molecules have been assembled, DNA 

constructs can be assembled by ligating together the DNA molecule encoding the first 
or second chimeric protein with appropriate regulatory sequences including, without 
limitation, a promoter sequence operably connected 5' to the DNA molecule, a 3* 
regulatory sequence operably connected 3' of the DNA molecule, as well as any 

10 enhancer elements, suppressor elements, etc. The DNA construct can then be inserted 
into an appropriate expression vector. Thereafter, the vector can be used to transform 
a host cell, typically although not exclusively a prokaryote, and the recombinant host 
cell can express the first or second chimeric protein of the present invention. 

When a prokaryotic host cell is selected for subsequent transformation, 

15 the promoter region used to construct the DNA construct (i.e., transgene) should be 
appropriate for the particular host. The DNA sequences of eukaryotic promoters, as 
described infra for expression in eukaryotic host cells, differ from those of prokaryotic 
promoters. Eukaryotic promoters and accompanying genetic signals may not be 
recognized in or may not function in a prokaryotic system and, further, prokaryotic 

20 promoters are not recognized and do not function in eukaryotic cells. 

Similarly, translation of mRNA in prokaryotes depends upon the 
presence of the proper prokaryotic signals which differ from those of eukaryotes. 
Efficient translation of mRNA in prokaryotes requires a ribosome binding site called 
the Shine-Dalgarno ("SD") sequence on the mRNA. This sequence is a short 

25 nucleotide sequence of mRNA that is located before the start codon, usually AUG, 
which encodes the amino-terminal methionine of the protein. The SD sequences are 
complementary to the 3'-end of the 16S rRNA (ribosomal RNA) and probably 
promote binding of mRNA to ribosomes by duplexing with the rRNA to allow correct 
positioning of the ribosome. For a review on maximizing gene expression, see Roberts 

30 and Lauer, Methods in Enzvmology . 68:473 (1 979), which is hereby incorporated by 
reference in its entirety. 
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Promoters vary in their "strength" (i.e., their ability to promote 
transcription). For the purposes of expressing a cloned gene, it is desirable to use 
strong promoters in order to obtain a high level of transcription and, hence, expression 
of the gene. Depending upon the host cell system utilized, any one of a number of 
5 suitable promoters may be used. For instance, when cloning in E. coli, its 

bacteriophages, or plasmids, promoters such as the T7 phage promoter, lac promoter, 
trp promoter, recA promoter, ribosomal RNA promoter, the P R and P L promoters of 
coliphage lambda and others, including but not limited, to lacUV5, ompF, bla, Ipp, and 
the like, may be used to direct high levels of transcription of adjacent DNA segments. 

10 Additionally, a hybrid trp4ac\JV5 (tac) promoter or other E. coli promoters produced 
by recombinant DNA or other synthetic DNA techniques may be used to provide for 
transcription of the inserted gene. 

Bacterial host cell strains and expression vectors may be chosen which 
inhibit the action of the promoter unless specifically induced. In certain operons, the 

1 5 addition of specific inducers is necessary for efficient transcription of the inserted 
DNA. For example, the lac operon is induced by the addition of lactose or IPTG 
(isopropylthio-beta-D-galactoside). A variety of other operons, such as trp, pro, etc., 
are under different controls. 

Specific initiation signals are also required for efficient gene 

20 transcription and translation in prokaryotic cells. These transcription and translation 
initiation signals may vary in "strength" as measured by the quantity of gene specific 
messenger RNA and protein synthesized, respectively. The DNA expression vector, 
which contains a promoter, may also contain any combination of various "strong" 
transcription and/or translation initiation signals. For instance, efficient translation in 

25 E. coli requires a Shine-Dalgarno ("SD") sequence about 7-9 bases 5' to the initiation 
codon ("ATG") to provide a ribosome binding site. Thus, any SD-ATG combination 
that can be utilized by host cell ribosomes may be employed. Such combinations 
include, but are not limited to, the SD-ATG combination from the cro gene or the /V 
gene of coliphage lambda, or from the E. coli tryptophan E, D, C, B or A genes. 

30 Additionally, any SD-ATG combination produced by recombinant DNA or other 
techniques involving incorporation of synthetic nucleotides may be used. 
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Mammalian cells can also be used to recombinantly produce the first or 
second chimeric proteins of the present invention. Suitable mammalian host cells 
include, without limitation: COS (e.g., ATCC No. CRL 1650 or 1651), BHK (e.g., 
ATCC No. CRL 6281), CHO (ATCC No. CCL 61), HeLa (e.g., ATCC No. CCL 2), 
5 293 (ATCC No. 1573), CHOP, and NS-1 cells. Suitable expression vectors for 

directing expression in mammalian cells generally include a promoter, as well as other 
transcription and translation control sequences known in the art. Common promoters 
include, without limitation, SV40, MMTV, metallothionein-l, adenovirus Ela, CMV, 
immediate early, immunoglobulin heavy chain promoter and enhancer, and RSV-LTR. 

10 Regardless of the selection of host cell, once the DNA molecule coding 

for a first or second chimeric protein has been Iigated to its appropriate regulatory 
regions using well known molecular cloning techniques, it can then be introduced into 
a suitable vector or otherwise introduced directly into a host cell using transformation 
protocols well known in the art (Sambrook et al., Molecular Cloning: A Laboratory 

15 Manual , Second Edition, Cold Spring Harbor Press, NY (1989), which is hereby 
incorporated by reference in its entirety). 

The recombinant DNA molecule can be introduced into host cells via 
transformation, particularly transduction, conjugation, mobilization, or electroporation. 
Suitable host cells include, but are not limited to, bacteria, virus, yeast, mammalian 

20 cells, insect, plant, and the like. The host cells, when grown in an appropriate medium, 
are capable of expressing the chimeric protein, which can then be isolated therefrom 
and, if necessary, purified. The first or second chimeric protein is preferably produced 
in purified form (preferably at least about 80%, more preferably 90%, pure) by 
conventional techniques, including immuno-purification techniques. Immuno-isolation 

25 followed by metal-chelating affinity chromatography and cation ic exchange 
chromatography is described in Example 1 infra. 

A further aspect of the present invention relates to a number of 
compositions, preferably pharmaceutical compositions, which include the first and/or 
second chimeric protein of the present invention. 

30 According to one embodiment, a composition includes a 

pharmaceutically acceptable carrier and the first chimeric protein of the present 
invention. The first chimeric protein is preferably present in an amount which is 
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effective to modify apolipoprotein B mRNA editing in cells which uptake the first 
chimeric protein. 

According to a second embodiment, a composition includes the first 
and second chimeric proteins of the present invention. This composition can also 
5 include a pharmaceutical^ acceptable carrier in which the first and second chimeric 
proteins are dispersed. Preferably, the first chimeric protein is present in an amount 
which is effective to modify apolipoprotein B mRNA editing in cells which uptake the 
first chimeric protein and the second chimeric protein is present in an amount which is 
effective to bind apolipoprotein B mRNA and assist the first chimeric protein in 
10 modifying apolipoprotein B mRNA in cells which uptake the first and second chimeric 
proteins. 

The compositions of the present invention can also include suitable 
excipients, or stabilizers, and can be in solid or liquid form such as, tablets, capsules, 
powders, solutions, suspensions, or emulsions. Typically, the compositions will 

1 5 contain from about 0.01 to 99 percent, preferably from about 20 to 75 percent of the 
chimeric protein(s), together with the carrier, excipient, stabilizer, etc. 

The solid unit dosage forms can be of the conventional type. The solid 
form can be a capsule, such as an ordinary gelatin type containing the first and/or 
second chimeric protein(s) of the present invention and a carrier, for example, 

20 lubricants and inert fillers such as, lactose, sucrose, or cornstarch. In another 
embodiment, these first and/or second chimeric protein(s) are tableted with 
conventional tablet bases such as lactose, sucrose, or cornstarch in combination with 
binders like acacia, cornstarch, or gelatin, disintegrating agents, such as cornstarch, 
potato starch, or alginic acid, and a lubricant, like stearic acid or magnesium stearate. 

25 The first and/or second chimeric protein(s) of the present invention may 

also be administered in injectable or topically-applied dosages by solution or 
suspension of these materials in a physiologically acceptable diluent with a 
pharmaceutical carrier. Such carriers include sterile liquids, such as water and oils, 
with or without the addition of a surfactant and other pharmaceutically and 

30 physiologically acceptable carrier, including adjuvants, excipients or stabilizers. 
Illustrative oils are those of petroleum, animal, vegetable, or synthetic origin, for 
example, peanut oil, soybean oil, or mineral oil. In general, water, saline, aqueous 
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dextrose and related sugar solution, and glycols, such as propylene glycol or 
polyethylene glycol, are preferred liquid carriers, particularly for injectable solutions. 

For use as aerosols, the first and/or second chimeric protein(s) of the 
present invention in solution or suspension may be packaged in a pressurized aerosol 
5 container together with suitable propellants, for example, hydrocarbon propellants like 
propane, butane, or isobutane with conventional adjuvants. The compositions of the 
present invention also may be administered in a non-pressurized form such as in a 
nebulizer or atomizer. 

Depending upon the treatment being effected, the compounds of the 
10 present invention can be administered orally, topically, transdermally, parenterally, 
subcutaneously, intravenously, intramuscularly, intraperitoneal^, by intracavitary or 
intravesical instillation, intraocularly, intraarterially, intralesionally, or by application to 
mucous membranes, such as, that of the nose, throat, and bronchial tubes. In most 
instances, subcutaneous, intravenous, intramuscular, intraperitoneal, and intraarterial 
1 5 routes are preferred. 

Compositions within the scope of this invention include all 
compositions wherein the first and/or second chimeric proteins of the present invention 
is contained in an amount effective to achieve its intended purpose, noted above. 
While individual needs vary, determination of optimal ranges of effective amounts of 
20 each of the first and second chimeric proteins is within the skill of the art. Typical 
dosages comprise about 0.01 to about 100 mg/kgbody wt The preferred dosages 
comprise about 0.1 to about 100 mg/kgbody wt. The most preferred dosages 
comprise about 1 to about 100 mg/kg body wt. 

The amounts of the first and second chimeric proteins can be 
25 determined by one of ordinary skill in the art using routine testing to optimize the 

dosage levels of the first and second chimeric proteins in accordance with the desired 
degree of apolipoprotein B mRNA editing. Based on May 2001 guidelines by the 
National Institutes of Health's National Cholesterol Education Program (NCEP), 
individuals at low risk for a heart attack should have LDL levels under 160 mg/dL, 
30 while those at highest risk should aim for LDLs under 100 mg/dL. Treatment regimen 
for the administration of the first and/or second chimeric proteins of the present 
invention can also be determined readily by those with ordinary skill in art. 
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Typically, the first and/or second chimeric proteins (or compositions 
which contain one or both of the chimeric proteins of the present invention) can be 
administered via a drug delivery device which includes a chimeric protein or a 
composition of the present invention. Exemplary delivery devices include, without 
5 limitation, liposomes, niosomes, transdermal patches, implants, and syringes. 

Liposomes are vesicles comprised of one or more concentrically 
ordered lipid bilayers which encapsulate an aqueous phase. They are normally not 
leaky, but can become leaky if a hole or pore occurs in the membrane, if the membrane 
is dissolved or degrades, or if the membrane temperature is increased to the phase 

10 transition temperature. Current methods of drug delivery via liposomes require that 
the liposome carrier ultimately become permeable and release the encapsulated drug at 
the target site. This can be accomplished, for example, in a passive manner wherein 
the liposome bilayer degrades over time through the action of various agents in the 
body. Every liposome composition will have a characteristic half-life in the circulation 

15 or at other sites in the body and, thus, by controlling the half-life of the liposome 
composition, the rate at which the bilayer degrades can be somewhat regulated. 

In contrast to passive drug release, active drug release involves using an 
agent to induce a permeability change in the liposome vesicle. Liposome membranes 
can be constructed so that they become destabilized when the environment becomes 

20 acidic near the liposome membrane (see, e.g., Proc. Natl. Acad. Sci. USA 84:7851 
(1987); Biochemistry 28:908 (1989), which is hereby incorporated by reference in its 
entirety). When liposomes are endocytosed by a target cell, for example, they can be 
routed to acidic endosomes which will destabilize the liposome and result in drug 
release. 

25 Alternatively, the liposome membrane can be chemically modified such 

that an enzyme is placed as a coating on the membrane which slowly destabilizes the 
liposome. Since control of drug release depends on the concentration of enzyme 
initially placed in the membrane, there is no real effective way to modulate or alter 
drug release to achieve "on demand" drug delivery. The same problem exists for pH- 

30 sensitive liposomes in that as soon as the liposome vesicle comes into contact with a 
target cell, it will be engulfed and a drop in pH will lead to drug release. 



WO 02/068676 



PCT/US02/05824 



-35- 

This liposome delivery system can also be made to accumulate at a 
target organ, tissue, or cell via active targeting. In accordance with the present 
invention, liposomes can be targeted to liver cells by incorporating into the liposome 
bilayer a molecule which target hepatocyte receptors. One such molecule is the 
5 asialoglycoprotein asialofetuin, which targets the asialoglycoprotein receptor of 
hepatocytes. The incorporation of asialofetuin into the liposome bilayer can be 
performed according to the procedures set forth in Wu et al., "Increased liver uptake 
of liposomes and improved targeting efficacy by labeling with asialofetuin in rodents," 
Hepatologv 27(3):772-778 (1998), which is hereby incorporated by reference in its 
10 entirety. 

Niosomes are vesicles formed by amphiphilic materials. Non-ionic 
surfactants were the first materials studied (Iga et al., "Membrane modification by 
negatively charged stearylpolyoxyethylene derivatives for thermosensitive liposomes: 
Reduced liposomal aggregation and avoidance of reticuloendothelial system uptake," I 

1 5 Drug Target 2:259-67 ( 1 994), which is hereby incorporated by reference in its 

entirety) and a large number of surfactants have since been found to self assemble into 
closed bilayer vesicles (Ahl et al., "Enhancement of the in vivo circulation lifetime of 
L-alpha-distearoylphosphatidylcholine liposomes: Importance of liposomal aggregation 
versus complement opsonization," Biochim Biophys Acta 1329:370-82 (1997), which 

20 is hereby incorporated by reference in its entirety). These niosomal materials may be 
used for delivery of the first or second chimeric protein or for delivery of APOBEC-1 
or fragments thereof alone or in combination with ACF or fragments thereof. 

For example, 200nm doxorubicin niosomes with a polyoxyethylene 
(molecular weight 1,000) surface have been shown to be rapidly taken up by the liver 

25 (Uchegbu et al., "Distribution, metabolism and tumoricidal activity of doxorubicin 

administered in sorbitan monostearate (Span 60) niosomes in the mouse," Pharm. Res. 

12: 1019-24 (1995), which is hereby incorporated by reference in its entirety), allowing 

polymeric drug conjugates to be formed for delivery of the drug (see Duncan, "Drug 

* 

polymer conjugates — potential for improved chemotherapy," Anti-Cancer Drugs 
30 3:1 75-2 10(1 992), which is hereby incorporated by reference in its entirety). These 
techniques can be readily adapted for delivery of the first and second chimeric proteins 
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or, alternatively, APOBEC-1 or a fragment thereof alone or in combination with ACF 
or a fragment thereof. 

Compositions including the liposomes or niosomes in a 
pharmaceutical^ acceptable carrier are also contemplated. 
5 Transdermal delivery devices have been employed for delivery of low 

molecular weight proteins by using lipid-based compositions (i.e., in the form of a 
patch) in combination with sonophoresis. However, as reported in U.S. Patent No. 
6,041,253 to Ellinwood, Jr. et al., which is hereby incorporated by reference in its 
entirety, transdermal delivery can be further enhanced by the application of an electric 

10 field, for example, by iontophoresis or electroporation. Using low frequency 

ultrasound which induces cavitation of the lipid layers of the stratum corneum, higher 
transdermal fluxes, rapid control of transdermal fluxes, and drug delivery at lower 
ultrasound intensities can be achieved. Still further enhancement can be obtained using 
a combination of chemical enhancers and/or magnetic field along with the electric field 

1 5 and ultrasound. 

Implantable or injectable protein depot compositions can also be 
employed, providing long-term delivery of, e.g., the first and second chimeric 
proteins. For example, U.S. Patent No. 6,331,31 1 to Brodbeck et al., which is hereby 
incoiporated by reference in its entirety, reports an injectable depot gel composition 
20 which includes a biocompatible polymer, a solvent that dissolves the polymer and 

forms a viscous gel, and an emulsifying agent in the form of a dispersed droplet phase 
in the viscous gel. Upon injection, such a gel composition can provide a relatively 
continuous rate of dispersion of the agent to be delivered, thereby avoiding an initial 
burst of the agent to be delivered. 
25 Other suitable protein delivery system which are known to those of skill 

in the art can also be employed to achieve the desired delivery and, thus, modification 
in the editing of apolipoprotein B mRNA and its concomitant effects. 

By virtue of the first chimeric protein being able to edit apolipoprotein 
B mRNA, the present invention affords a method of modifying apolipoprotein B 
mRNA editing in vivo. This aspect of the present invention can be carried out by 
contacting apolipoprotein B mRNA in a cell with the first chimeric protein of the 
present invention under conditions effective to increase the concentration of 
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apolipoprotein B48 which is secreted by the cell as compared to the concentration of 
apolipoprotein B100 which is secreted by the cell, relative to an untreated cell (i.e., 
which has not taken up the first chimeric protein). Basically, the contacting is carried 
out by exposing the cell to the first chimeric protein under conditions effective to 
5 induce cellular uptake of the first chimeric protein. Because the first chimeric protein 
includes the first polypeptide (i.e., which includes a protein transduction domain), the 
first chimeric protein is taken up by the cell. In addition, the same cell can also be 
contacted with the second chimeric protein of the present invention, causing the 
second chimeric protein also to be taken up by the cell. As a result, the apolipoprotein 

10 B mRNA in the cell is contacted by the second chimeric protein, binding the 

apolipoprotein mKNA (as described above) so as to facilitate editing thereof by the 
first chimeric protein. The cell in which the apolipoprotein B mRNA editing is 
modified can be any cell which can synthesize and secrete VLDL with apolipoprotein 
B or its derivatives. Exemplary cells of this type include liver cells and intestinal cells, 

1 5 although preferably liver cells. The cell can also be in a mammal, preferably a human. 

Likewise, the present invention also affords a method of reducing serum 
LDL levels. This aspect of the present invention can be carried out by delivering into 
one or more cells of a patient, without genetically modifying the cells, an amount of a 
protein comprising APOBEC-1 or a fragment thereof which can edit mRNA encoding 

20 apolipoprotein B, which amount is effective to increase the concentration of VLDL- 
apolipoprotein B48 that is secreted by the one or more cells into serum and, 
consequently, reduce the serum concentration of LDL. In accordance with this aspect 
of the present invention, the patient is a mammal, preferably a human, and the one or 
more cells are preferably liver cells, intestinal cells, or a combination thereof. 

25 To sustain the reduced serum LDL levels, delivery of the protein into 

the one or more cells is preferably repeated periodically (i.e., following a delay of from 
about 1 to about 7 days). 

Delivery of the protein into the one or more cells can be carried out by 
exposing the one or more cells to the protein under conditions effective to cause 

30 cellular uptake of the protein. Preferably, the protein which includes APOBEC-1 or a 
fragment thereof is actually the first chimeric protein of the present invention and the 
protein transduction domain induces cellular uptake by the one or more cells. In 
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addition to delivering the protein, a second protein can also be delivered 
simultaneously into the one or more cells of the patient, without genetically modifying 
the cells, where the second protein includes ACF or a fragment thereof which can bind 
to apolipoprotein B mRNA. Preferably, the second protein is the second chimeric 
5 protein of the present invention and the protein transduction domain induces cellular 
uptake by the one or more cells. 

Alternatively, APOBEC- 1 can be delivered directly into one or more 
liver cells by contacting each of them with liposomes including a molecule which binds 
to a hepatocyte receptor (e.g., asialofetuin), thereby inducing uptake of the liposomes 

1 0 and degradation thereof intracellular^ to empty their contents into the one or more 
liver cells. In addition, ACF or a fragment thereof which can bind to apolipoprotein B 
mRNA can also be delivered via the liposomes. 

By increasing the ratio of apolipoprotein B48 to apolipoprotein B100 
which is secreted by the one or more cells, the present invention also relates to a 

1 5 method of treating or preventing an atherogenic disease or disorder. This aspect of the 
present invention can be carried out by administering to a patient an effective amount 
of a protein comprising APOBEC- 1 or a fragment thereof which can edit mRNA 
encoding apolipoprotein B, wherein upon said administering the protein is taken up by 
one or more cells of the patient that can synthesize and secrete VLDL-apolipoprotein 

20 under conditions which are effective to increase the concentration of VLDL- 
apolipoprotein B48 that is secreted by the one or more cells into serum, whereby rapid 
clearing of VLDL-apolipoprotein B48 from serum decreases the serum concentration 
of LDL to treat or prevent the atherogenic disease or disorder. In accordance with this 
aspect of the present invention, the patient is a mammal, preferably a human, and the 

25 one or more cells are preferably liver cells. 

Administration of the protein can be carried out according to any of the 
above-identified approaches. Continued preventative or therapeutic treatment can be 
effected by repeatedly administering the APOBEC- 1 protein periodically (i.e., 
following a delay of from about 1 to about 7 days). 

30 Preferably, the protein which includes APOBEC- 1 or a fragment 

thereof is actually the first chimeric protein of the present invention and the protein 
transduction domain induces cellular uptake by the one or more cells. As with the 
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above-described methods, a second protein that includes ACF or a fragment thereof 
which can bind to apolipoprotein B mRNA can also be delivered simultaneously. 
Preferably, the second protein is the second chimeric protein of the present invention 
and the protein transduction domain induces cellular uptake by the one or more cells. 
5 - Alternatively, using a liposome delivery vehicle, APOBEC-1 and 

optionally ACF can be delivered directly into one or more liver cells by contacting each 
of them with a liposome including a molecule which binds to a hepatocyte receptor, 
thereby inducing uptake of the liposomes and degradation thereof intracellular^ to 
empty their contents into the one or more liver cells. 

10 

EXAMPLES 

The following examples are intended to illustrate, but by no means are 
intended to limit, the scope of the present invention as set forth in the appended claims. 

15 

Example 1 - Generation of TAT Fusion Protein 

The induction of hepatic apolipoprotein B mRNA editing was sought 
through TAT mediated APOBEC-1 protein transduction into liver cells. It has been 

20 shown that linking an 1 1 -amino-acid protein transduction domain (PTD) of HIV- 1 
TAT protein to heterologous protein conferred the ability to transduce into cells 
(Nagahara et al., "Transduction of full-length TAT fusion proteins into mammalian 
cells: TAT-p27 Kipl induces cell migration," Nature Med. 4:1449-1452 (1998); 
Schwarze et al., "In vivo protein transduction: delivery of a biologically active protein 

25 into the mouse," Science 285: 1569-1572 (1999); Vocero-Akbani et al., "Killing HIV- 
infected cells by transduction with an HIV protease-activated caspase-3 protein," 
Nature Med. 5:29-33 (1999), each of which is hereby incorporated by reference in its 
entirety). PTD-linked protein transduced into -100% of cells and the transduction 
process occurred in a rapid and concentration-dependent but receptor- and 

30 transporter-independent manner (Schwarze et al., "Protein transduction: unrestricted 
delivery into all cells," Trends Cell Biol. 10:290-295 (2000), which is hereby 
incorporated by reference in its entirety). Liver cells have been shown to be 
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susceptible to transduction (Nagahara et a!., "Transduction of full-length TAT fusion 
proteins into mammalian cells: TAT-p27 Kipl induces cell migration," Nature Med. 
4: 1449-1452 (1998), which is hereby incorporated by reference in its entirety). In 
order to produce in-frame TAT fusion protein from E. coli, a prokaryotic expression 
5 vector was constructed that has an N-terminal PTD flanked by glycine residues for free 
bond rotation of the domain (Schwarze et al., "In vivo protein transduction: delivery of 
a biologically active protein into the mouse," Science 285: 1 569-1572 (1999), which is 
hereby incorporated by reference in its entirety), an hemagglutinin (HA) tag and a C- 
terminal 6-histidine tag. Using this vector as a backbone, a plasmid was constructed to 

10 encode full-length TAT-rAPOBEC-CMPK protein, SEQ ID No: 4 (Figures 2A, 2D, 
and 5A). APOBEC-1 conjugated to CMPK was used in this study because it showed a 
less robust editing activity in vitro and targeted primarily cytoplasmic mRNAs (Yang 
et al., "Induction of cytidine to uridine editing on cytoplasmic apolipoprotein B mRNA 
by overexpressing APOBEC-1," J. Biol. Chem. 275:22663-22669 (2000), which is 

15 hereby incorporated by reference in its entirety). In vitro studies demonstrated that 
APOBEC-1 retained catalytic activity when conjugated to various lengths of non- 
specific proteins (Siddiqui et al., "Disproportionate relationship between APOBEC-1 
expression and apoB mRNA editing activity " Exp. Cell Res. 252:154-164 (1999); 
Yang et al., "Induction of cytidine to uridine editing on cytoplasmic apolipoprotein B 

20 mRNA by overexpressing APOBEC-1," J. Biol. Chem. 275:22663-22669 (2000), each 
of which is hereby incorporated by reference in its entirety). 

A double-stranded oligomeric nucleotide encoding the 9-amino acid 
TAT domain flanked by glycine residues (sense strand shown below, SEQ ID No: 25) 

25 catatgggaa gaaaaaaaag aagacaaaga agaagaggcc tcgag 4 5 

and a PCR product encoding HA-rAPOBEC-CMPK (SEQ ID No: 26 as set forth 
below) 

30 atgggctcta gataccccta cgacgtgccc gactacgccg atatcagttc cgagacaggc 60 
cctgtagctg ttgatcccac tctgaggaga agaattgagc cccacgagtt tgaagtcttc 120 
tttgaccccc gggaacttcg gaaagagacc tgtctgctgt atgagatcaa ctggggagga 180 
aggcacagca tctggcgaca cacgagccaa aacaccaaca aacacgttga agtcaatttc 24 0 
atagaaaaat ttactacaga aagatacttt tgtccaaaca ccagatgctc cattacctgg 300 

35 ttcctgtcct ggagtccctg tggggagtgc tccagggcca ttacagaatt tttgagccga 360 
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tacccccatg taactctgtt tatttatata 

aatcggcaag gactcaggga ccttattagc 

caagagtctg gctactgctg gaggaatttt 

tggccaaggt acccccatct gtgggtgagg 

5 ttaggacttc caccctgttt aaatatttta 

acgattgctc ttcaaagctg ccattaccaa 

gggttgaaag aattccacgc tgccatggca 

gacatcgact ccgagccaac cattgccaga 

gcctcccgct ctgtggacaa gctgaaggaa 

10 ctcaacttct cgcacggcac ccacgagtat 

gccacagaga gctttgcctc tgacccgatc 

accaagggac ctgaaatccg aactggactc 

ctcaagaagg gcgcagctct caaagtgacg 

gagaatgtgc tgtgggtgga ctacaagaac 

15 atctatgtgg atgacggtct catttccttg 

atgactgagg ttgagaacgg tggcatgctt 

gctgcggtcg acctgcctgc agtctcagag 

gagcagaatg tggacatggt gttcgcttcc 

gtcaggaagg tgctagggga aaagggaaag 

20 cacgagggtg tgcgcaggtt tgatgagatc 

cgtggtgacc tgggtattga gatccctgct 

attgggcgct gcaacagggc tggcaaaccc 

atgatcaaga aacctcgccc gacccgcgct 

gatggagcag actgcatcat gctgtctggg 

25 gctgtgcgca tgcagcacgc tattgctcgt 

cagtttgaag aaatcttacg ccacagtgta 

gcaggcgcgg tggaggcctc ctttaagtgc 

tctggcaggt ctgcacacct ggtgtcccgg 

acccgcaatg accaaacagc acgccaggca 

30 tgcaagcagc cggcccacga tgcctgggca 

atgaatgtcg gcaaagcccg tggattcttc 

ggctggcgcc ccggctccgg ctacaccaac 



gcacggcttt atcaccacgc agatcctcga 420 
agcggtgtta ctatccagat catgacggag 480 
gtcaactact ccccttcgaa tgaagctcat 540 
ctgtacgtac tggaactcta ctgcatcatt 600 
agaagaaaac aacctcaact cacgtttttc 660 
aggctaccac cccacatcct gtgggccaca 720 
gacacctttc tggagcacat gtgccgcctg 780 
aacaccggca tcatctgcac catcggccca 840 
atgattaaac ctggaatgaa tgttgcccgc 900 
catgagggca caattaagaa cgtgcgagag 960 
acctacagac ctgtggctat tgcactggac 1020 
atcaagggaa gtggcacagc agaggtggag 1080 
ctggacaatg ccttcatgga gaactgcgat 1140 
ctcatcaaag ttatagatgt gggcagcaaa 1200 
ctggttaagg agaaaggcaa ggactttgtc 1260 
ggtagtaaga agggagtgaa cctcccaggt 1320 
aaggacattc aggacctgaa atttggcgtg 1380 
ttcatccgca aagctgctga tgtccatgct 1440 
cacatcaaga ttatcagcaa gattgagaat 1500 
atggaggcca gcgatggcat tatggtggcc 1560 
gaaaaagtct tcctcgcaca gaagatgatg 1620 
atcatttgtg ccactcagat gttggaaagc 1680 
gagggcagtg atgttgccaa tgcagttctg 1740 
gagaccgcca agggagacta cccactggag 1800 
gaggctgagg ccgcaatgtt ccatcgtcag 1860 
caccacaggg agcctgctga tgccatggca 1920 
ttagcagcag ctctgatagt tatgaccgag 1980 
taccgcccgc gggctcccat catcgccgtc 2040 
cacctgtacc gcggcgtctt ccccgtgctg 2100 
gaggatgtgg atctccgtgt gaacctgggc 2160 
aagaccgggg acctggtgat cgtgctgacg 222 0 
accatgcggg tggtgcccgt gcca 2274 



or HA-CMPK (SEQ ID No: 27 as set forth below) 

35 

ctcgagatgt acccctacga cgtgcccgac tacgccgata tccacgctgc catggcagac 60 

acctttctgg agcacatgtg ccgcctggac atcgactccg agccaaccat tgccagaaac 120 

accggcatca tctgcaccat cggcccagcc tcccgctctg tggacaagct gaaggaaatg 180 

attaaatctg gaatgaatgt tgcccgcctc aacttctcgc acggcaccca cgagtatcat 240 

40 gagggcacaa ttaagaacgt gcgagaggcc acagagagct ttgcctctga cccgatcacc 300 

tacagacctg tggctattgc actggacacc aagggacctg aaatccgaac tggactcatc 360 

aagggaagtg gcacagcaga ggtggagctc aagaagggcg cagctctcaa agtgacgctg 420 

gacaatgcct tcatggagaa ctgcgatgag aatgtgctgt gggtggacta caagaacctc 480 

atcaaagtta tagatgtggg cagcaaaatc tatgtggatg acggtctcat ttccttgctg 540 

45 gttaaggaga aaggcaagga ctttgtcatg actgaggttg agaacggtgg catgcttggt 600 

agtaagaagg gagtgaacct cccaggtgct gcggtcgacc tgcctgcagt ctcagagaag 660 

gacattcagg acctgaaatt tggcgtggag cagaatgtgg acatggtgtt cgcttccttc 720 

atccgcaaag ctgctgatgt ccatgctgtc aggaaggtgc taggggaaaa gggaaagcac 780 

atcaagatta tcagcaagat tgagaatcac gagggtgtgc gcaggtttga tgagatcatg 840 

50 gaggccagcg atggcattat ggtggcccgt ggtgacctgg gtattgagat ccctgctgaa 900 

aaagtcttcc tcgcacagaa gatgatgatt gggcgctgca acagggctgg caaacccatc 960 

atttgtgcca ctcagatgtt ggaaagcatg atcaagaaac ctcgcccgac ccgcgctgag 1020 

ggcagtgatg ttgccaatgc agttctggat ggagcagact gcatcatgct gtctggggag 1080 

accgccaagg gagactaccc actggaggct gtgcgcatgc agcacgctat tgctcgtgag 1140 

55 gctgaggccg caatgttcca tcgtcagcag tttgaagaaa tcttacgcca cagtgtacac 1200 

cacagggagc ctgctgatgc catggcagca ggcgcggtgg aggcctcctt taagtgctta 1260 

gcagcagctc tgatagttat gaccgagtct ggcaggtctg cacacctggt gtcccggtac 1320 

cgcccgcggg ctcccatcat cgccgtcacc cgcaatgacc aaacagcacg ccaggcacac 1380 

ctgtaccgcg gcgtcttccc cgtgctgtgc aagcagccgg cccacgatgc ctgggcagag 1440 

60 gatgtggatc tccgtgtgaa cctgggcatg aatgtcggca aagcccgtgg attcttcaag 1500 

accggggacc tggtgatcgt gctgacgggc tggcgccccg gctccggcta caccaacacc 1560 

atgcgggtgg tgcccgtgcc atgactcgag 1590 
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(Yang et al., "Induction of cytidine to uridine editing on cytoplasmic apolipoprotein B 
mRNA by overexpressing APOBEC-1," J. Biol. Chem. 275:22663-22669 (2000), 
which is hereby incorporated by reference in its entirety) were inserted into NdeVXhoi 
digested pPROEX vector (Life, Gaithersburg, Maryland). The entire constructs (TAT- 
5 rAPOBEC-CMPK (SEQ ID No: 3) or TAT-CMPK (SEQ ID No: 28 as set forth 
below) 



catatgggaa gaaaaaaaag aagacaaaga agaagaggcc tcgagatgta cccctacgac 60 

gtgcccgact acgccgatat ccacgctgcc atggcagaca cctttctgga gcacatgtgc 120 

10 cgcctggaca tcgactccga gccaaccatt gccagaaaca ccggcatcat ctgcaccatc 180 

ggcccagcct cccgctctgt ggacaagctg aaggaaatga ttaaatctgg aatgaatgtt 240 

gcccgcctca acttctcgca cggcacccac gagtatcatg agggcacaat taagaacgtg 300 

cgagaggcca cagagagctt tgcctctgac ccgatcacct acagacctgt ggctattgca 360 

ctggacacca agggacctga aatccgaact ggactcatca agggaagtgg cacagcagag 42 0 

15 gtggagctca agaagggcgc agctctcaaa gtgacgctgg acaatgcctt catggagaac 480 

tgcgatgaga atgtgctgtg ggtggactac aagaacctca tcaaagttaC agatgtgggc 540 

agcaaaatct atgtggatga cggtctcatt tccttgctgg ttaaggagaa aggcaaggac 600 

tttgtcatga ctgaggttga gaacggtggc atgcttggta gtaagaaggg agtgaacctc 660 

ccaggtgctg cggtcgacct gcctgcagtc tcagagaagg acattcagga cctgaaattt 720 

20 ggcgtggagc agaatgtgga catggtgttc gcttccttca tccgcaaagc tgctgatgtc 780 

catgctgtca ggaaggtgct aggggaaaag ggaaagcaca tcaagattat cagcaagatt 840 

gagaatcacg agggtgtgcg caggtttgat gagatcatgg aggccagcga tggcattatg 900 

gtggcccgtg gtgacctggg tattgagatc cctgctgaaa aagtcttcct cgcacagaag 960 

atgatgattg ggcgctgcaa cagggctggc aaacccatca tttgtgccac tcagatgttg 1020 

25 gaaagcatga tcaagaaacc tcgcccgacc cgcgctgagg gcagtgatgt tgccaatgca 1080 

gtCctggatg gagcagactg catcatgctg tctggggaga ccgccaaggg agactaccca 1140 

ctggaggctg tgcgcatgca gcacgctatt gctcgtgagg ctgaggccgc aatgttccat 1200 

cgtcagcagt ttgaagaaat cttacgccac agtgtacacc acagggagcc tgctgatgcc 1260 

atggcagcag gcgcggtgga ggcctccttt aagtgcttag cagcagctct gatagttatg 1320 

30 accgagtctg gcaggtctgc acacctggtg tcccggtacc gcccgcgggc tcccatcatc 1380 

gccgtcaccc gcaatgacca aacagcacgc caggcacacc tgtaccgcgg cgtcttcccc 1440 

gtgctgtgca agcagccggc ccacgatgcc tgggcagagg atgtggatct ccgtgtgaac 1500 

ctgggcatga atgtcggcaa agcccgtgga ttcttcaaga ccggggaccC ggtgatcgtg 1560 

ctgacgggct ggcgccccgg ctccggctac accaacacca tgcgggtggt gcccgtgcca 1620 

35 tgactcgag 1629 

were inserted into p£T-24b (Novagen, Madison, Wisconsin) vector to take advantage 
of the C-terminal His 6 tag. TAT fusion proteins (referred to as TAT-CMPK, the 
expression product of SEQ ID No: 28, and TAT-rAPOBEC-CMPK, SEQ ID No: 4) 

40 were purified from BL-21(DE3) codon plus cells (Stratagene, La Jolla, California). 
Two to four 1 -liter cultures were inoculated with a 10 ml overnight culture each and 
induced by 0. 1 mM IPTG at 30°C for 1 hour. Soluble proteins were obtained by 
French press in 25 ml of buffer A (8M urea, 10 mM Tris pH 8, 100 mM NaH 2 P0 4 ). 
Cellular lysates were cleared by centrifugation, loaded onto a 5-ml Ni-NTA column 

45 (Qiagen, Valencia, California) in buffer A with 10-20 mM imidazole, washed and 

eluted with imidazole in buffer A 'stepwise' (100, 175 and 250 mM) and loaded onto a 
HiTrap SP column (Amersham Pharmacia, Piscataway, New Jersey). The column was 
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washed and eluted with 1 M NaCl in buffer A. The urea and high salt were removed 
from the relevant fractions by rapid dialysis against buffer B (30 mM Tris pH=8.5, 50 
mM NaCl, 10|iM zinc acetate, 5% glycerol). The elution profile was analyzed by 
SDS-PAGE. Gels were stained with silver according to manufacture's 
5 recommendations (Bio-Rad, Hercules, California). 

Recombinant proteins were solubilized in 8M urea buffer from bacterial 
cells so as to maximize their yield from inclusion bodies. Previous studies have shown 
that denatured proteins could transduce as well as native proteins (Schwarze et al., "In 
vivo protein transduction: delivery of a biologically active protein into the mouse," 

10 Science 285: 1569-1572 (1999), which is hereby incorporated by reference in its 

entirety). The proteins were purified through metal-chelating affinity chromatography 
followed by cationic exchange chromatography. The urea was removed by rapid 
dialysis and the purity of full-length 86 kDa TAT-rAPOBEC-CMPK, SEQ ID No: 4, 
was apparent as shown by silver staining (Figure 5B). The purification of full-length 

15 protein was also confirmed by western blot using anti-His 6 antibody. 

Example 2 - In vitro Introduction of TAT-rAPOBEC-CMPK 
into McArdle Cells 

20 The uptake of TAT-rAPOBEC-CMPK, SEQ ID No: 4, into McArdle 

cells was evaluated using an antibody reactive with the HA epitope and fluorescence 
microscopy. 

McArdle RH7777 cells were obtained from ATCC (Manassas, Virginia) 
and cultured as described previously (Yang et al., "Partial characterization of the 

25 auxiliary factors involved in apo B mRNA editing through APOBEC-1 affinity 
chromatography," J. Biol. Chem. 272:27700-27706 (1997), which is hereby 
incorporated by reference in its entirety). McArdle cells, grown on six well cluster 
plates were treated with either TAT-rAPOBEC-CMPK or TAT-CMPK for the 
indicated times. Cells were then washed extensively with PBS and subsequently fixed 

30 with 2% paraformaldehyde, permeabilized with 0.4% Triton X 1 00, blocked with 1 % 
BSA and reacted with affinity purified anti-HA (Babco, Berkeley, CA) and affinity 
purified FITC conjugated goat anti-mouse secondary antibody (Organon Teknika, 
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West Chester, PA), each at 1: 1000 dilution. Fluorescence was observed and electronic 
images captured on an inverted, fluorescence Olympus microscope. 

Recombinant APOBEC-1 has a tendency to aggregate, a property 
which persists in TAT-rAPOBEC-CMPK, apparent as aggregates of HA antibody- 
5 reactive material attached to the surface of cells Ih following the addition of the 
protein to the media (Figures 6A-B). Aggregation was not a property of the TAT 
motif or CMPK as control protein (TAT-CMPK) at a higher molar concentration 
appeared as an array of speckles attached to the surface of McArdle cells 1 h following 
its addition to the media (Figures 7 A and B). 

1 0 Within 6 h following treatment, both TAT-rAPOBEC-CMPK (Figures 

6C-D) and TAT-CMPK (Figures 7C-D) were apparent inside the cells and the cell 
surface-attached aggregates appeared to be more disperse. Following 24 h of 
treatment, many of the cells treated with TAT-rAPOBEC-CMPK demonstrated bright 
perinuclear fluorescence and also a low intensity of fluorescence throughout the 

1 5 nucleus and cytoplasm (Figures 6E-F). Cells treated for 24 h with TAT-CMPK 
demonstrated bright fluorescent speckles in the cytoplasm and fainter homogenous 
nuclear fluorescence (Figure 7E-F). The nuclear distribution of the recombinant 
protein might have been facilitated by the embedded nuclear localization signal (NLS) 
in TAT sequence (Schwarze et al., "In vivo protein transduction: delivery of a 

20 biologically active protein into the mouse," Science 285: 1569-1572 (1999), which is 
hereby incorporated by reference in its entirety) as APOBEC-1 alone does not have a 
functional NLS (Yang et al., "Multiple protein domains determine the cell type-specific 
nuclear distribution of the catalytic subunit required for apo B mRNA editing," Proc. 
Natl. Acad. Sci. USA 94:13075-13080 (1997), which is hereby incorporated by 

25 reference in its entirety) and 6His-HA-APOBEC-CMPK was excluded from the 
nucleus (Yang et al., "Induction of cytidine to uridine editing on cytoplasmic 
apolipoprotein B mRNA by overexpressing APOBEC-1," J. Biol. Chem. 275:22663- 
22669 (2000), which is hereby incorporated by reference in its entirety). The data 
suggested that both TAT-rAPOBEC-CMPK and TAT-CMPK were taken up by 

30 McArdle cells. Comparatively, the efficiency of TAT-rAPOBEC-CMPK uptake was 
poorer than that for TAT-CMPK, and the distribution of these proteins within the cells 
appeared different. 
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Example 3 - Measurement of Apolipoprotein B mRNA Editing in TAT- 
rAPOBEC-CMPK Transduced McArdle CeUs 

5 Given that TAT-CMPK entered McArdle cells, as demonstrated in 

Example 2, an evaluation was made as to whether this would affect apolipoprotein B 
mRNA editing activity (Figure 8). Cells were treated with the indicated amounts of 
TAT-CMPK (using the same preparation of protein as in Figure 7) and total cellular 
RNA was isolated following 24 h and the proportion of edited apolipoprotein B 

1 0 mRNA measured. 

Total cellular RNA was isolated from cells with Tri-Reagent (Molecular 
Research Center, Cincinnati, Ohio) according to manufacture's recommendations. 
Purified RNAs were digested with RQ-DNase I (Promega, Madison, Wisconsin) and 
with Rsal (Promega) restriction enzyme that has a recognition site between the PCR 

1 5 annealing sites of target substrates to ensure the removal of the contaminating genomic 
DNA. 

Editing activity was determined by the reverse transcriptase-polymerase 
chain reaction (RT-PCR) methodology described previously (Smith et al. "In vitro 
apolipoprotein B mRNA editing: Identification of a 27S editing complex," Proc. Natl. 
20 Acad. Sci. USA 88: 1489-1493 (1991), which is hereby incorporated by reference in its 
entirety). First strand cDNA was generated using oligo dT-primed total cellular RNA. 
Specific PCR amplification of rat apolipoprotein B sequence surrounding the editing 
site was accomplished using ND1/ND2 primer pairs set forth below: 

25 NDi (SEQ ID No: 29) 

atctgactgg gagagacaag tag 23 

ND2 (SEQ ID No: 30) 

gttcttttta agtcctgtgc ate 23 

30 

PCR products were gel isolated and the editing efficiency was 
determined by poisoned primer extension assay using 32 P ATP (NEN, Boston, 
Massachusetts) end-labeled DD3 primer (SEQ ID No: 31) as follows: 
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aatcatgtaa atcataacta tctttaatat actga 35 

under high concentration of dideoxy GTP as described previously (Smith et al. "In 
5 vitro apolipoprotein B mRNA editing: Identification of a 27S editing complex," Proc. 
Natl. Acad. Sci. USA 88:1489-1493 (1991); Sowden et al., "Overexpression of 
APOBEC-1 results in mooring-sequence-dependent promiscuous RNA editing," I 
Biol. Chem. 271:301 1-3017 (1996), each of which is hereby incorporated by reference 
in its entirety). Primer extension products were resolved on a 10% denaturing 

10 polyacrylamide gel, autoradiographed, and then quantified by a laser densitometric 

scanning (Molecular Dynamics, Sunnyvale, California). Percent editing was calculated 
as the counts in the UAA (edited) band divided by the sum of the counts in UAA and 
those in the CAA (unedited) bands and multiplied by 100. 

No change in the percent editing of apolipoprotein B mRNA relative to 

1 5 untreated cells (see Figure 9) was observed with TAT-CMPK concentrations ranging 
from 45 to 1 125 nM (5 to 133 ng protein/ml of media) (Figure 8). 

In contrast, editing activity increased in McArdle cells with 360 nM (62 
Hg protein/ml media) TAT-rAPOBEC-CMPK following 6 h and continued to a peak 
by 24 h, a more than 3-fold increase over the level of editing observed in control cells 

20 (Figure 9). The proportion of edited RNA remained elevated up to 48 h after 

treatment (Figure 9) and approached baseline by 72 h. It has been reported that the 
enzymatic activity lagged the appearance of the transduced protein inside the cells, 
probably due to a slow refolding of the transduced protein (Schwarze et al., "In vivo 
protein transduction: delivery of a biologically active protein into the mouse," Science 

25 285: 1 569- 1 572 ( 1 999), which is hereby incorporated by reference in its entirety). 

Taken together, the results demonstrated that TAT-rAPOBEC-CMPK transduced into 
McArdle cells, refolded into an enzymatically active conformation over the first 6 hr 
and then edited apolipoprotein B mRNA. The reduction in the proportion of edited 
apolipoprotein B mRNA after 48 hr was likely due to enzyme inactivation and 

30 apolipoprotein B mRNA turnover. This characteristic was important as it 

demonstrated the transient and reversible nature of the protein transduction system. 
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Example 4 - In vitro Introduction of TAT-rAPOBEC-CMPK into Primary 
Hepatocytes 

To determine if the results obtained using McArdle cells would be 
5 applicable in primary liver cells, cultured rat primary hepatocytes were prepared and 
then treated with TAT-rAPOBEC-CMPK. The rat primary hepatocytes were prepared 
from unfasted, male Sprague-Dawley rats (250-275 g body weight, Taconic Farm) fed 
ad libitum normal rat chow as described previously (Van Mater et al., "Ethanol 
increases apolipoprotein B mRNA editing in rat primary hepatocytes and McArdle 

10 cells," Biochem. Biophvs. Res. Comm. 252:334-339 (1998), which is hereby 

incorporated by reference in its entirety). Recombinant TAT fusion protein was added 
directly to the cell culture media after dialysis. 

It has been shown that the editing efficiency in primary rat hepatocytes 
decreased as a result of proliferation after 72 hours in culture (Van Mater et al, 

1 5 "Ethanol increases apolipoprotein B mRNA editing in rat primary hepatocytes and 

McArdle cells," Biochem. Biophvs. Res. Comm. 252:334-339 (1998), which is hereby 
incorporated by reference in its entirety). Together with the fact that TAT-rAPOBEC- 
CMPK maximally increased editing 24 hours after treatment in McArdle cells, a 
decision was made to evaluate dose response for a fixed time rather than study 

20 kinetics. Primary hepatocytes were treated with the indicated amounts of TAT- 
rAPOBEC-CMPK and analyzed for edited apolipoprotein B mRNA 24 hours 
afterwards. Analysis of apolipoprotein B mRNA was carried out a described in 
Example 3 above. 

The editing activity of hepatocytes increased in proportion to the 

25 amount of TAT-rAPOBEC-CMPK added to the cell culture media relative to cells 
treated with buffer alone (Figure 10) or treated with TAT-CMPK (Figure 8). Given 
that the primary hepatocytes were seeded at the same cell number as McArdle cells, a 
comparison of the data in Figures 9 and 10 suggested that TAT-rAPOBEC-CMPK 
was more effective in inducing editing activity in the primary cell culture. This was true 

30 for several preparations of recombinant protein and primary cells and, therefore, the 
difference may be due to the fact that the primary hepatocytes have a higher baseline of 
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editing than McArdle cells (48% versus 7%) and/or may be "primed" with more 
auxiliary factors. 

Promiscuous editing of additional cytidines in rat apolipoprotein B 
mRNA of transfected cells (Sowden et al, "Overexpression of APOBEC-1 results in 
5 mooring-sequence-dependent promiscuous RNA editing," J. Biol. Chem. 27 1 : 30 1 1 - 
3017 (1996); Yamanaka et al., "Hyperediting of multiple cytidines of apolipoprotein B 
mRNA by APOBEC-1 requires auxiliary protein(s) but not a mooring sequence motif," 
J. Biol. Chem. 271:1 1506-1 1510 (1996), Sowden et al., "Apolipoprotein B RNA 
Sequence 3' of the mooring sequence and cellular sources of auxiliary factors 

1 0 determine the location and extent of promiscuous editing," Nucleic Acids Res. 

26: 1644-1652 (1998), each of which is hereby incorporated by reference in its entirety) 
or hyper-editing of other mRNAs in transgenic mice and rabbits (Yamanaka et al., 
"Hyperediting of multiple cytidines of apolipoprotein B mRNA by APOBEC-1 
requires auxiliary protein(s) but not a mooring sequence motif," J. Biol. Chem. 

15 271:1 1506-1 1510 (1996); Yamanaka et al., "A novel translation^ repressor mRNA is 
edited extensively in livers containing tumors caused by the transgene expression of the 
apoB mRNA editing enzyme," Genes & Dev. 1 1:321-333 (1997), each of which is 
hereby incorporated by reference in its entirety) has been observed in response to very 
high levels of APOBEC-1 expression. Editing of cytidines 5' of the wild type editing 

20 site (C6666) was a bellwether for the loss of editing site fidelity in rat cells and could 
be used to monitor the induction of promiscuous editing in relation to changes in 
APOBEC-1 expression (Sowden et al., "Apolipoprotein B RNA Sequence 3' of the 
mooring sequence and cellular sources of auxiliary factors determine the location and 
extent of promiscuous editing," Nucleic Acids Res. 26: 1644-1652 (1998); Siddiqui et 

25 al., "Disproportionate relationship between APOBEC-1 expression and apoB mRNA 
editing activity," Exp. Cell Res. 252: 1 54-164 (1999), each of which is hereby 
incorporated by reference in its entirety). Promiscuous editing of cytidine 3* C6666 in 
apolipoprotein B mRNA did not occur to a significant extent in rat cells and 
hyperediting of mRNAs other than apolipoprotein B was not a characteristic of 

30 APOBEC-1 overexpression in rat cells (Sowden et al., "Apolipoprotein B RNA 
Sequence 3* of the mooring sequence and cellular sources of auxiliary factors 
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determine the location and extent of promiscuous editing," Nucleic Acids Res. 
26: 1644-1652 (1998), which is hereby incorporated by reference in its entirety). 

Despite the high level of editing activity in treated primary hepatocytes, 
promiscuous editing (evident as additional primer extension products above UAA 
5 (Sowden et al., "Determinants involved in regulating the proportion of edited 

apolipoprotein B RNAs," RNA 2:274-288 (1996); Sowden et al., "Apolipoprotein B 
RNA Sequence 3' of the mooring sequence and cellular sources of auxiliary factors 
determine the location and extent of promiscuous editing," Nucleic Acids Res. 
26: 1644-1652 (1998), each of which is hereby incorporated by reference in its entirety) 

10 was not observed (Figure 10). Given that our detection limit for promiscuous editing 
was 0.3% (Sowden et al., "Determinants involved in regulating the proportion of 
edited apolipoprotein B RNAs," RNA 2:274-288 (1996), which is hereby incorporated 
by reference in its entirety) the data suggested that TAT-rAPOBEC-CMPK could be 
used to substantially increase site-specific editing of apolipoprotein B mRNA without 

1 5 significant loss of fidelity of the reaction. 

Example 5 - Analysis of Secreted Lipoprotein Products by Transduced 
Primary Hepatocytes 

20 To further confirm the efficacy of this method, secreted apolipoprotein 

B protein was evaluated in primary rat hepatocytes that were long-term metabolically 
labeled with [ 35 S]-methionine and [ 35 S]-cysteine after TAT-rAPOBEC-CMPK 
treatment. 

Twelve to eighteen hour rat primary hepatocytes grown in Waymouth's 
25 752/1 media (Sigma, St. Louis, MO) were treated for 1 1 hours with TAT-rAPOBEC- 
CMPK and then incubated for 1 hour in DMEM deficient medium (without 
methionine, cysteine and L-glutamine) (Sigma, St. Louis, MO) containing 0.2% (w/v) 
BSA, 0.1 nM insulin, 100 |ig/ml streptomycin and 50 jig/ml gentamicin. The medium 
was replaced with fresh labeling medium containing 0.7jiCi/ml L-[ 35 S]-Methionine and 
30 L-[ 35 S]-Cysteine using EXPRE 33 S 35 S protein labeling mix (NEN, Boston, 

Massachusetts). Cells were incubated in the labeling medium for 30 minutes. One 
volume of Waymouth's medium with cold cysteine and methionine was added to cells 



WO 02/068676 



PCTAJS02/05824 



-50- 

and the labeling continued for an additional 12 hours, after which cell culture medium 
was collected for the isolation and analysis of secreted apolipoprotein B protein and 
RNAs. (RNA analysis was conducted as in Example 3 above.) 

Immunoprecipitation of apolipoprotein B from cell culture medium was 
5 performed as described previously (Sparks et al., "Insulin-mediated inhibition of 
apolipoprotein B secretion requires an intracellular trafficking event and 
phosphatidylinositol 3-kinase activation: studies with brefeldin A and wortmannin in 
primary cultures of rat hepatocytes," Biochem. J. 313:567-574 (1996), which is 
hereby incorporated by reference in its entirety). A rabbit polyclonal antibody raised 

10 against rat apolipoprotein B and reactive with the N-terminus of apolipoprotein B100 
and apolipoprotein B48 (obtained from Drs. J.D. Sparks and C.E. Sparks, University 
of Rochester) was used to precipitate apolipoprotein B. The immunoprecipitants were 
separated by SDS-PAGE on 5% gel. The gel was dried and exposed to film to reveal 
the secreted apolipoprotein B containing lipoprotein profile which represents the 

1 5 secreted apolipoprotein B48 and apolipoprotein Bl 00 during the 12 hour labeling 
period. 

The secreted [ 35 S]-labeled apolipoprotein B lipoproteins were isolated 
from the cell culture media exposed to cells for 12 hours followed by 
immunoprecipitation, and analyzed by autoradiography after SDS-PAGE separation. 

20 The signal on the gel was in direct proportion to the number of cysteine and 
methionine residues in apolipoprotein B100 and apolipoprotein B48. Since 
apolipoprotein B48 was the N-terminal 48% of apolipoprotein B100, stronger signal 
was expected from apolipoprotein B 100 in control cells. However, as the editing 
efficiency approached 90% due to TAT-rAPOBEC-CMPK treatment, an increasing 

25 amount of apolipoprotein B48 was secreted, and apolipoprotein B100 became almost 
undetectable (Figure 11). Thus, lowering apolipoprotein B100 associated atherogenic 
risk factors through precisely controlled hepatic apolipoprotein B mRNA editing was 
achievable by protein transduction with TAT-rAPOBEC-CMPK. 
Discussion of Examples 1-5 

30 It is believed that the present invention offers a novel approach to 

curtail hepatic output of apolipoprotein B100 associated atherogenic factors through 
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up-regulating apolipoprotein B mRNA editing by using protein transduction into target 
(e.g., liver) cells. The PTD, amino acid residues 49-57, of HTV-1 TAT protein has 
been used in other systems to deliver functional full-length protein molecules into cells 
(Nagahara et al., "Transduction of full-length TAT fusion proteins into mammalian 
5 cells: TAT-p27 Kipl induces cell migration," Nature Med. 4: 1449-1452 (1998); 

Schwarze et al., "In vivo protein transduction: delivery of a biologically active protein 
into the mouse," Science 285: 1569-1572 (1999); Vocero-Akbani et al, "Killing HIV- 
infected cells by transduction with an HIV protease-activated caspase-3 protein," 
Nature Med. 5:29-33 (1999), each of which is hereby incorporated by reference in its 

10 entirety). Some of these fusion molecules, when introduced into mice, entered all 
tissue cells, even crossing the blood brain barrier (Schwarze et al., "In vivo protein 
transduction: delivery of a biologically active protein into the mouse," Science 
285. 1 569- 1 572 ( 1 999), which is hereby incorporated by reference in its entirety). 
Although the detailed mechanism for the cellular uptake of the fusions remains 

1 5 unknown, denaturing of the protein during membrane transduction is thought to be a 
rapid process and the rate limiting event is the renaturing of the transduced protein 
once inside of cells (Schwarze et al., "Protein transduction: unrestricted delivery into 
all cells," Trends Cell Biol. 10:290-295 (2000), which is hereby incorporated by 
reference in its entirety). 

20 In this regard, the protein transduction method may have limitations in 

that some proteins may not be able successfully to adopt an active conformation after 
they have been unfolded. It is significant, therefore, that the above Examples 
demonstrate that both TAT-CMPK (expression product of SEQ ID No: 28) and TAT- 
rAPOBEC-CMPK (SEQ ID No: 4) had the capacity to enter hepatocytes and that 

25 TAT-rAPOBEC-CMPK activated editing within 6 hours of its addition to the media. 
Similar kinetics have been observed with TAT-rAPOBEC-CMPK prepared under 
native conditions. 

Importantly, TAT-CMPK could not stimulate editing activity, 
demonstrating that the observed changes in editing were specific to APOBEC-1 

30 containing recombinant proteins. Considering the tendency for APOBEC-1 containing 
proteins to aggregate, part of the lag in entering cells could have been due to the 
inability of these multimeric complexes to cross the plasma membrane and the time it 
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took for TAT-rAPOBEC-CMPK monomers to dissociate from the aggregates and 
cross the membrane. This is supported by the finding that TAT-CMPK, which did not 
appear to form large aggregates, appeared to accumulate within the cells with more 
rapid kinetics than that observed for TAT-rAPOBEC-CMPK. The six hour lag before 
5 an increase in editing activity could be measured may have also been due to the time 
required for the transduced protein to refold and assemble editosomes. 

Apolipoprotein B mRNA editing occurs in the cell nucleus despite the 
fact that editing factors can also be demonstrated in the cytoplasm (Yang et al., 
"Induction of cytidine to uridine editing on cytoplasmic apolipoprotein B mRNA by 

10 overexpressing APOBEC-1 ," J. Biol. Chem. 275:22663-22669 (2000), which is hereby 
incorporated by reference in its entirety). The mechanism responsible for APOBEC- 
1 *s distribution in the nucleus is not understood (Yang et al, "Intracellular Trafficking 
Determinants in APOBEC-1, the Catalytic Subunit for Cytidine to Uridine Editing of 
ApoB mRNA," Exp. Cell Res. 267: 163-184 (2001), which is hereby incorporated by 

1 5 reference in its entirety), however its mass appeared to be important as the chimeric 
protein APOBEC-CMPK was excluded from the nucleus (Yang et al., "Multiple 
protein domains determine the cell type-specific nuclear distribution of the catalytic 
subunit required for apo B mRNA editing," Proc. Natl. Acad. Sci. USA 94: 13075- 
13080 (1997); Yang et al., "Induction of cytidine to uridine editing on cytoplasmic 

20 apolipoprotein B mRNA by overexpressing APOBEC-1," J. Biol. Chem. 275:22663- 
22669 (2000), each of which is hereby incorporated by reference in its entirety). TAT- 
rAPOBEC-CMPK's ability to distribute in both the cytoplasm and the nucleus was 
consistent with the proposed ability of the TAT PTD to act also as a nuclear 
localization signal (Schwarze et al., "In vivo protein transduction: delivery of a 

25 biologically active protein into the mouse," Science 285: 1569-1 572 (1 999), which is 
hereby incorporated by reference in its entirety). Although TAT-rAPOBEC-CMPK's 
distribution mimicked that of the wild type enzyme's distribution (Yang et al., 
"Multiple protein domains determine the cell type-specific nuclear distribution of the 
catalytic subunit required for apo B mRNA editing," Proc. Natl. Acad. Sci. USA 

30 94:13075-13080 (1997), which is hereby incorporated by reference in its entirety), 
uncertainty remains as to whether all of the transduced TAT-rAPOBEC-CMPK 
molecules were active in editing, as well as whether cytoplasmic or nuclear transcripts 
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were edited. Nonetheless, regardless of the degree of activity or its localization within 
the cell, a positive reduction in apolipoprotein B100 lipoprotein was demonstrated. 

Enhancement of editing activity by overexpression of APOBEC-1 
through gene transfer has been shown to be associated with promiscuous editing on 
5 both nuclear and cytoplasmic transcripts (Sowden et al., "Overexpression of 

APOBEC-1 results in mooring-sequence-dependent promiscuous KNA editing," I 
Biol. Chem. 271:301 1-3017 (1996); Yang et al., "Induction of cytidine to uridine 
editing on cytoplasmic apolipoprotein B mRNA by overexpressing APOBEC-1, " 1 
Biol. Chem. 275:22663-22669 (2000), each of which is hereby incorporated by 

10 reference in its entirety). Metabolic stimulation of apolipoprotein B mRNA editing 
always retained fidelity (Wu et al., "ApoB mRNA editing: validation of a sensitive 
assay and developmental biology of RNA editing in the rat," J. Biol. Chem. 
265:12312-12316 (1990); Greeve et al., "Apolipoprotein B mRNA editing in 12 
different mammalian species: hepatic expression is reflected in low concentrations of 

15 apoB-containing plasma lipoproteins," J. Lipid Res. 34:1367-1383 (1993); Phung et 
al., "Regulation of hepatic apoB RNA editing in the genetically obese Zucker rat," 
Metabolism 45: 1056-1058 (1996); von Wronski et al., "Insulin increases expression of 
apobec-1, the catalytic subunit of the apoB B mRNA editing complex in rat 
hepatocytes," Metabolism Clinical & Exp. 7:869-873 (1998), each of which is hereby 

20 incorporated by reference in its entirety). It is highly significant, therefore, that the 
fidelity of the editing activity was retained with TAT-rAPOBEC-CMPK even when 
editing was enhanced to >90%. This level of high fidelity editing could not be 
achieved without hyper-editing in apobec-1 transgenic animals (Yamanaka et al., 
"Hyperediting of multiple cytidines of apolipoprotein B mRNA by APOBEC-1 

25 requires auxiliary protein(s) but not a mooring sequence motif," J. Biol. Chem. 

271: 1 1506-1 1510 (1996); Yamanaka et al., "A novel translational repressor mRNA is 
edited extensively in livers containing tumors caused by the transgene expression of the 
apoB mRNA editing enzyme," Genes & Dev. 1 1 : 32 1-333 (1997); Sowden et al, 
"Overexpression of APOBEC-1 results in mooring-sequence-dependent promiscuous 

30 RNA editing," J. Biol. Chem. 271:301 1-3017 (1996); Sowden et al., "Apolipoprotein 
B RNA Sequence 3' of the mooring sequence and cellular sources of auxiliary factors 
determine the location and extent of promiscuous editing," Nucleic Acids Res. 
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26: 1644-1652 (1998); each of which is hereby incorporated by reference in its 
entirety). There was no pathology in transgenic animals in which induction of hepatic 
apolipoprotein B mRNA editing was achieved at a low level of apobec-1 expression 
and these animals had a markedly lower semm apolipoprotein B100 and significantly 
5 reduced serum LDL compared to controls (Teng et al, "Adeno virus-mediated gene 
transfer of rat apolipoprotein B mRNA editing protein in mice virtually eliminates 
apolipoprotein B-100 and normal low density lipoprotein production," J. Biol. Chem. 
269:29395-29404 (1994); Hughs et al., "Gene transfer of cytidine deaminase 
APOBEC-1 lowers lipoprotein(a) in transgenic mice and induces apolipoprotein B 

10 mRNA editing in rabbits," Hum. Gene Ther. 7:39-49 (1996); Kozarsky et al., "Hepatic 
expression of the catalytic subunit of the apolipoprotein B mRNA editing enzyme 
ameliorates hypercholesterolemia in LDL receptor-deficient rabbits," Hum. Gene Ther. 
7:943-957 (1996); Farese et al., "Phenotypic analysis of mice expressing exclusively 
apolipoprotein B48 or apolipoprotein B100," Proc. Natl. Acad. Sci. USA 93:6393- 

15 6398 (1996); Qian et al., "Low expression of the apolipoprotein B mRNA editing 
transgene in mice reduces LDL but does not cause liver dysplasia or tumors," 
Arteriosc. Thromb. Vase. Biol. 18:1013-1020 (1998); Wu et al., "Normal perinatal rise 
in serum cholesterol is inhibited by hepatic delivery of adenoviral vector expressing 
apolipoprotein B mRNA editing enzyme in rabbits," J. Surg. Res. 85:148-157 (1999), 

20 each of which is hereby incorporated by reference in its entirety). Interestingly, 
apobec-1 gene transfer into apobec-1 gene knockout mice restored editing and 
reduced serum LDL levels (Nakamuta et al., "Complete phenotypic characterization of 
the apobec-1 knockout mice with a wild-type genetic background and a human 
apolipoprotein B transgenic background, and restoration of apolipoprotein B mRNA 

25 editing by somatic gene transfer of Apobec-1," J. Biol. Chem. 271:25981-25988 

(1996), which is hereby incorporated by reference in its entirety), demonstrating that 
APOBEC-1 has therapeutic potential in livers with no prior editing activity. The 
induction of hepatic editing of apolipoprotein B mRNA in apobec-1 transgenic rabbits 
with an LDL receptor deficiency also ameliorated hypercholesterolemia (Kozarsky et 

30 al., "Hepatic expression of the catalytic subunit of the apolipoprotein B mRNA editing 
enzyme ameliorates hypercholesterolemia in LDL receptor-deficient rabbits," Hum. 
Gene Ther. 7:943-957 (1996), which is hereby incorporated by reference in its 
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entirety). Taken together, these studies suggested that apolipoprotein B mRNA 
editing could be safely targeted as a mechanism for reducing serum LDL and the risk 
of atherogenic diseases. 

However, controlling a low level of apobec-1 expression using gene 
5 therapy is difficult and, quite often, unpredictable. For all of these reasons, despite the 
limited success of gene therapy approaches, gene therapy using apobec-1 does not 
appear to be a promising avenue which can be pursued for preventative or therapeutic 
control over atherogenic disease factors. The advantage of protein transduction 
therapy is that the dose can be modulated relative to the desired response and that the 

10 effect on editing can be terminated by withdrawing therapy. 

The PTD should allow protein to enter all cells of the body, even if the 
protein is delivery intravenously (Schwarze et al., "In vivo protein transduction: 
delivery of a biologically active protein into the mouse," Science 285: 1 569-1 572 
(1999), which is hereby incorporated by reference in its entirety). Ideally the liver 

1 5 should be specifically targeted with TAT-rAPOBEC-CMPK and an intraperitoneal 
injection can be utilized to accomplish a first pass clearance, transducing most of the 
protein into hepatocytes. Even though APOBEC-1 is not widely expressed in tissues 
(Teng et al., "Molecular cloning of an apo B messenger RNA editing protein," Science 
260: 181 16-1819 (1993), which is hereby incorporated by reference in its entirety), its 

20 generalized expression in transgenic animals did not induce pathology (Teng et al., 
"Adenovirus-mediated gene transfer of rat apolipoprotein B mRNA editing protein in 
mice virtually eliminates apolipoprotein B-100 and normal low density lipoprotein 
production," J. Biol. Chem. 269:29395-29404 (1994); Hughs et al., "Gene transfer of 
cytidine deaminase APOBEC-1 lowers lipoprotein(a) in transgenic mice and induces 

25 apolipoprotein B mRNA editing in rabbits," Hum. Gene Ther. 7:39-49 (1996); 

Kozarsky et al., "Hepatic expression of the catalytic subunit of the apolipoprotein B 
mRNA editing enzyme ameliorates hypercholesterolemia in LDL receptor-deficient 
rabbits," Hum. Gene Ther. 7:943-957 (1996); Farese et al., "Phenotypic analysis of 
mice expressing exclusively apolipoprotein B48 or apolipoprotein B100," Proc. Natl. 

30 Acad. Sci. USA 93:6393-6398 (1996); Qian et al., "Low expression of the 

apolipoprotein B mRNA editing transgene in mice reduces LDL but does not cause 
liver dysplasia or tumors," Arteriosc. Thromb. Vase. Biol. 1 8: 1 0 1 3- 1 020 ( 1 998); Wu 



WO 02/068676 



PCT/US02/05824 



-56- 

et al., <f Normal perinatal rise in serum cholesterol is inhibited by hepatic delivery of 
adenoviral vector expressing apolipoprotein B mRNA editing enzyme in rabbits," 1 
Surg. Res. 85:148-157 (1999), each of which is hereby incorporated by reference in its 
entirety). 

5 Uptake of TAT-rAPOBEC-CMPK or TAT-hAPOBEC-CMPK is 

unlikely to induce any side effects. Aside from one study suggesting that 
overexpression of APOBEC-1 in liver can lead to editing of mRNAs other than 
apolipoprotein B (Yamanaka et al., "A novel translational repressor mRNA is edited 
extensively in livers containing tumors caused by the transgene expression of the apoB 

10 mRNA editing enzyme," Genes & Dev. 1 1:321-333 (1997), which is hereby 

incorporated by reference in its entirety) no other mRNA substrates for APOBEC-1 
have been found (Skuse et al, "Neurofibromatosis type I mRNA undergoes base- 
modification RNA editing," Nucleic Acids Res. 24:478-486 (1996); Sowden et al., 
"Apolipoprotein B RNA Sequence 3' of the mooring sequence and cellular sources of 

1 5 auxiliary factors determine the location and extent of promiscuous editing," Nucleic 
Acids Res. 26: 1644-1652 (1998), each of which is hereby incorporated by reference in 
its entirety). Furthermore, apobec-1 gene knock out studies have shown that there 
were no other editing enzymes capable of editing apolipoprotein B mRNA and that 
APOBEC-1 was not required for life (Hirano et al., "Targeted disruption of the mouse 

20 apobec-1 gene abolishes apolipoprotein B mRNA editing and eliminates apolipoprotein 
B48," J. Biol. Chem. 271:9887-9890 (1996); Nakamuta et al, "Complete phenotypic 
characterization of the apobec-1 knockout mice with a wild- type genetic background 
and a human apolipoprotein B transgenic background, and restoration of 
apolipoprotein B mRNA editing by somatic gene transfer of Apobec-1," J. Biol. Chem. 

25 271:25981-25988 (1996), each of which is hereby incorporated by reference in its 
entirety). Taken together the data suggest that mRNA editing by APOBEC is self- 
limited due to its specificity for apolipoprotein B mRNA and, therefore, neither TAT- 
rAPOBEC-CMPK nor TAT-hAPOBEC-CMPK is likely to have effects in tissues other 
than those which express apolipoprotein B mRNA and auxiliary proteins. 

30 Current cholesterol-lowering therapies target circulating cholesterol at 

the level of enhanced elimination or reduced production. A sector of the population 
remains at risk for atherosclerosis due to side effects from current therapies in some of 
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these patients and the inability of others with defects in apolipoprotein B and/or the 
LDL receptor mediated uptake pathway to completely benefit from conventional 
cholesterol lowering therapies. Hypercholesterolemia is an early onset disease yet the 
restricted usage of conventional therapies among children due to the potential of 
5 interfering with pubertal development has not been resolved. Protein based therapies 
such as insulin or growth hormone have been extensively used among children to treat 
Type I diabetes or pituitary dwarfism, respectively. To the patient or the parent of the 
patient, the reversible nature of protein based therapy may be more appealing than 
gene therapy. To this end, the above results illustrate an alternative to conventional or 
10 gene therapy approaches for reducing the risk of atherosclerosis in the sectors of 
population at risk. 

Although preferred embodiments have been depicted and described in 
detail herein, it will be apparent to those skilled in the relevant art that various 
1 5 modifications, additions, substitutions, and the like can be made without departing 
from the spirit of the invention and these are therefore considered to be within the 
scope of the invention as defined in the claims which follow. 
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What Is Claimed: 

1 . A chimeric protein comprising: 

a first polypeptide comprising a protein transduction domain; 

5 and 

a second polypeptide comprising APOBEC- 1 or a fragment 
thereof which can edit mRNA encoding apolipoprotein B. 

2. The chimeric protein according to claim 1 wherein the protein 
10 transduction domain is an HTV TAT protein transduction domain. 

3. The chimeric protein according to claim 2, wherein the HTV 
TAT protein transduction domain comprises an amino acid sequence of SEQ ID No: 9. 

15 4. The chimeric protein according to claim 1 wherein the 

APOBEC- 1 or fragment thereof comprises an amino acid sequence of SEQ ID No: 11, 
SEQ ID No: 13, or SEQ ID No: 15, or fragments thereof 

5. The chimeric protein according to claim 1 further comprising: 
20 a third polypeptide comprising a cytoplasmic localization 

protein or a fragment thereof which, upon cellular uptake of the chimeric protein, 
enhances localization of the chimeric protein to the cytoplasm. 

6. The chimeric protein according to claim 5 wherein the 

25 cytoplasmic localization protein or fragment thereof is chicken muscle pyruvate kinase 
or a fragment thereof 



30 



7. The chimeric protein according to claim 6 wherein the chicken 
muscle pyruvate kinase or a fragment thereof comprises an amino acid sequence of 
SEQ ID No: 17 or fragments thereof. 
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8. The chimeric protein according to claim 5 wherein, within the 
chimeric protein, the third polypeptide is C-terminal of the second polypeptide. 

9. The chimeric protein according to claim 1 further comprising: 
5 a third polypeptide comprising a plurality of adjacent histidine 

residues. 

10. The chimeric protein according to claim 1 further comprising: 
a third polypeptide comprising a hemagglutinin domain. 

10 

1 1 . The chimeric protein according to claim 1 wherein, within the 
chimeric protein, the first polypeptide is N-terminal of the second polypeptide. 

12. The chimeric protein according to claim 1 , wherein the chimeric 
1 5 protein comprises an amino acid sequence of SEQ ID No: 2 or SEQ ID No: 4. 

13. The chimeric protein according to claim 1, wherein the chimeric 
protein is in isolated form. 

20 14. A composition comprising: 

a pharmaceutical^ acceptable carrier and 
the chimeric protein according to claim 1 . 

1 5. The composition according to claim 1 4, wherein the chimeric 
25 protein is present in an amount which is effective to modify apolipoprotein B mRNA 
editing in liver cells which uptake the chimeric protein. 



30 



16. The composition according to claim 14, wherein the 
composition is in the form of a tablet; capsule, powder, solution, suspension, or 
emulsion. 
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17. A chimeric protein comprising: 

a first polypeptide comprising a protein transduction domain; 

and 

a second polypeptide comprising ACF or a fragment thereof 
5 which can bind to apolipoprotein B mKNA to facilitate editing of the mRNA by 
APOBEC-1. 

1 8. The chimeric protein according to claim 1 7 wherein the protein 
transduction domain is an HTV tat protein transduction domain. 

10 

19. The chimeric protein according to claim 18, wherein the HTV tat 
protein transduction domain comprises an amino acid sequence of SEQ ID No: 9. 

20. The chimeric protein according to claim 17 wherein the ACF or 
15 fragment thereof comprises an amino acid sequence of SEQ ID No: 21 or SEQ ID 

No: 23 or fragments thereof. 

2 1 . The chimeric protein according to claim 17 further comprising: 
a third polypeptide comprising a plurality of adjacent histidine 

20 residues. 

22. The chimeric protein according to claim 17 further comprising: 
a third polypeptide comprising a hemagglutinin domain. 

25 23. The chimeric protein according to claim 1 7 wherein, within the 

chimeric protein, the first polypeptide is N-terminal of the second polypeptide. 

24. The chimeric protein according to claim 1 7 wherein the chimeric 
protein comprises an amino acid sequence of SEQ ID No: 6 or SEQ ID No: 8. 

30 

25. The chimeric protein according to claim 17 wherein the chimeric 
protein is in isolated form. 
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26. A composition comprising: 

a first chimeric protein comprising (i) a first polypeptide 
comprising a protein transduction domain and (ii) a second polypeptide comprising 
5 APOBEC-1 or a fragment thereof which can edit the mRNA encoding apolipoprotein 
B; and 

a second chimeric protein comprising (i) a first polypeptide 
comprising a protein transduction domain and (ii) a second polypeptide comprising 
ACF or a fragment thereof which can bind to apolipoprotein B mRNA to facilitate 
1 0 editing of the mRNA by APOBEC- 1 or the fragment thereof. 

27. The composition according to claim 26 wherein 

the first chimeric protein is present in an amount which is 
effective to modify apolipoprotein B mRNA editing in cells which uptake the first 
1 5 chimeric protein and 

the second chimeric protein is present in an amount which is 
effective to bind apolipoprotein B mRNA and assist the first chimeric protein in 
modifying apolipoprotein B mRNA in cells which uptake the first and second chimeric 
proteins. 

20 

28. The composition according to claim 26 wherein the first 
chimeric protein comprises an amino acid sequence of SEQ ID No: 2 or SEQ ID 
No: 4. 

25 29. The composition according to claim 26 wherein the second 

chimeric protein comprises an amino acid sequence of SEQ ID No: 6 or SEQ ID 

No: 8. 

30. The composition according to claim 26 further comprising: 
30 a pharmaceutical^ acceptable carrier in which the first and 

second chimeric proteins are dispersed. 
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3 1 . The composition according to claim 26 wherein the composition 
is in the form of a tablet, capsule, powder, solution, suspension, or emulsion. 

32. A DNA molecule encoding a chimeric protein according to 

5 claim 1 . 

33. The DNA molecule according to claim 32 comprising a 
nucleotide sequence of SEQ ID No: 1 or SEQ ID No: 3. 

10 34. A DNA construct comprising: 

the DNA molecule according to claim 32; 

a promoter sequence operably connected 5' to the DNA 

molecule; and 

a 3' regulatory sequence operably connected 3' of the DNA 

15 molecule. 

35. An expression vector comprising a DNA molecule according to 

claim 32. 

20 36. A recombinant host cell transformed with a DNA molecule 

according to claim 32. 

37. A DNA molecule encoding a chimeric protein according to 

claim 17. 

25 

38. The DNA molecule according to claim 37 comprising a 
nucleotide sequence of SEQ ID No: 5 or SEQ ID No: 7. 
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39. A DNA construct comprising: 

the DNA molecule according to claim 37; 

a promoter sequence operably connected 5' to the DNA 

molecule; and 

5 a 3' regulatory sequence operably connected 3' of the DNA 

molecule. 

40. An expression vector comprising a DNA molecule according to 

claim 37. 

10 

41 . A recombinant host cell transformed with a DNA molecule 
according to claim 37. 

42. A delivery device comprising a chimeric protein according to 

1 5 claim 1 . 

43. The delivery device according to claim 42, wherein the delivery 
device is in the form of a liposome, a niosome, a transdermal patch, an implant, or a 
syringe. 

20 

44. A delivery device comprising a composition according to 

claim 14. 

45. The delivery device according to claim 44, wherein the delivery 
25 device is in the form of a liposome, a niosome, a transdermal patch, an implant, or a 

syringe. 



30 



claim 26. 



46. 



A delivery device comprising a composition according to 
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47. The delivery device according to claim 46, wherein the delivery 
device is in the form of a liposome, a niosome, a transdermal patch, an implant, or a 
syringe. 

48. A method of modifying apolipoprotein B mRNA editing in vivo 

5 comprising: 

contacting apolipoprotein B mRNA in a cell with a chimeric 
protein according to claim 1 under conditions effective to increase the concentration of 
apolipoprotein B48 which is secreted by the cell as compared to the concentration of 
apolipoprotein B100 which is secreted by the cell, relative to an untreated cell. 

10 

49. The method according to claim 48 wherein the cell is a liver cell. 

50. The method according to claim 48 wherein the cell is present in 

a mammal. 

15 

51. 

said contacting: 
effective to induce 

20 

52. The method according to claim 48 wherein the chimeric protein 
comprises an amino acid sequence of SEQ ID No: 2 or SEQ ID No: 4. 

53. The method according to claim 48 wherein said contacting 
2 5 further comprises : 

contacting the apolipoprotein B mRNA in the cell with a second 
chimeric protein comprising (i) a first polypeptide comprising a protein transduction 
domain and (ii) a second polypeptide comprising ACF or a fragment thereof which can 
bind to apolipoprotein B mRNA. 

30 

54. The method according to claim 53 wherein the second chimeric 
protein comprises an amino acid sequence of SEQ ID No: 6 or SEQ ID No: 8. 



The method according to claim 48 further comprising prior to 

exposing the cell to the chimeric protein under conditions 
cellular uptake of the chimeric protein. 
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55. A method of reducing serum LDL levels comprising: 
delivering into one or more cells of a patient, without genetically 

modifying the cells, an amount of a protein comprising APOBEC-1 or a fragment 
5 thereof which can edit mRNA encoding apolipoprotein B, which amount is effective to 
increase the concentration of VLDL-apolipoprotein B48 that is secreted by the one or 
more cells into serum and, consequently, reduce the serum concentration of LDL. 

56. The method according to claim 55 wherein the one or more 
10 cells are liver cells, intestinal cells, or a combination thereof. 

57. The method according to claim 55 wherein the patient is a 

mammal. 

15 58. The method according to claim 57 wherein the mammal is a 

human. 

59. The method according to claim 55 wherein said delivering 

comprises: 

20 exposing the one or more cells to the protein under conditions 
effective to cause cellular uptake of the protein. 

60. The method according to claim 59 wherein the protein is a 
chimeric protein which further comprises a polypeptide comprising a protein 

25 transduction domain. 

61 . The method according to claim 60 wherein the chimeric protein 
comprises an amino acid sequence of SEQ ID No: 2 or SEQ ID No: 4. 



30 62. The method according to claim 59 wherein the protein is present 

in a liposome or niosome which is taken up by liver cells. 
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63. The method according to claim 55 wherein said delivering 
further comprises: 

simultaneously delivering into the one or more cells of the 
5 patient, also without genetically modifying the cells, an amount of a second protein 
comprising ACF or a fragment thereof which can bind to apolipoprotein B mRNA. 

64. The method according to claim 63 wherein said simultaneously 
delivering comprises: 

1 0 exposing the one or more cells to the second protein under 

conditions effective to cause cellular uptake of the second protein. 

65. The method according to claim 64 wherein the second protein is 
a chimeric protein which further comprises a polypeptide comprising a protein 

1 5 transduction domain. 

66. The method according to claim 65 wherein the chimeric protein 
comprises an amino acid sequence of SEQ ID No: 6 or SEQ ID No: 8. 

20 67. The method according to claim 55 further comprising: 

repeating said delivering following a delay. 

68. The method according to claim 67 wherein the delay is from 
about 1 to about 7 days. 

25 

69. A method of treating or preventing an atherogenic disease or 
disorder comprising: 

administering to a patient an effective amount of a protein 
comprising APOBEC-1 or a fragment thereof which can edit mRNA encoding 
30 apolipoprotein B, wherein upon said administering the protein is taken up by one or 
more cells of the patient that can synthesize and secrete VLDL-apolipoprotein B under 
conditions which are effective to increase the concentration of VLDL-apolipoprotein 
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B48 that is secreted by the one or more cells into serum, whereby rapid clearing of 
VLDL-apoIipoprotein B48 from serum decreases the serum concentration of LDL to 
treat or prevent the atherogenic disease or disorder. 

5 70. The method according to claim 69 wherein the patient is a 

mammal. 

71 . The method according to claim 70 wherein the mammal is a 

human. 

10 

72. The method according to claim 69 wherein said administering is 
carried out orally, topically, transdermal^, parenterally, subcutaneously, intravenously, 
intramuscularly, intraperitoneally, by intracavitary or intravesical instillation, 
intraocularly, intraarterially, intralesionally, by application to mucous membranes, or by 

15 implantation. 

73. The method according to claim 69 wherein the protein is a 
chimeric protein which further comprises a protein transduction domain. 

20 74. The method according to claim 73 wherein the chimeric protein 

comprises an amino acid sequence of SEQ ID No: 2 or SEQ ID No: 4. 

75. The method according to claim 69 wherein the polypeptide is 
present in a liposome or niosome which is taken up by liver cells. 

25 

76. The method according to claim 69 wherein said administering 
further comprises: 

second administering to the patient an effective amount of a 
second protein comprising ACF or a fragment thereof which can bind to 
30 apolipoprotein B mRNA. 
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77. The method according to claim 76 wherein said second 
administering is carried out simultaneously. 

78. The method according to claim 76 wherein the second 

5 polypeptide is a chimeric protein which further comprises a protein transduction 
domain. 

79. The method according to claim 78 wherein the chimeric protein 
comprises an amino acid sequence of SEQ ID No: 6 or SEQ ID No: 8. 

10 

80. The method according to claim 69 further comprising: 
repeating said administering following a delay. 

81 . The method according to claim 80 wherein the delay is from 
15 about 1 to about 7 days. 

82. A liposome or niosome which is targeted for uptake by a liver 
cell, the liposome or niosome containing (i) APOBEC-1 or a fragment thereof which is 
effective to edit apolipoprotein B mRNA, (ii) ACF or a fragment thereof which is 

20 effective to bind apolipoprotein B mRNA, or (iii) a combination thereof 

83. The liposome or niosome according to claim 82 in the form of a 
liposome comprising asialofetuin incorporated into a lipid bilayer. 

25 84. The liposome or niosome according to claim 82, in the form of a 

niosome comprising doxorubicin with a polyoxyethylene surface. 

85. The liposome or niosone according to claim 82, wherein the 
liposome or niosome contains APOBEC-1 or a fragment thereof which is effective to 
30 edit apolipoprotein B mRNA. 
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86. The liposome or niosome according to claim 82, wherein the 
liposome or niosome contains ACF or a fragment thereof which is effective to bind 
apolipoprotein B mRNA. 

5 87. The liposome or niosome according to claim 82, wherein the 

liposome or niosome contains a combination of APOBEC-1 or a fragment thereof 
which is effective to edit apolipoprotein B mKNA and ACF or a fragment thereof 
which is effective to bind apolipoprotein B mRNA. 



10 



88. A composition comprising: 

a pharmaceutical^ acceptable carrier and the liposome or 
niosome according to claim 82. 
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6-His 



Figure 1A 



atggctagca 
aAGAAAAAAA 
TGCCCGACTA 
cccactctga 
ccccagagaa 
gcatgagccg 
gtggaagtta 
atccatcagc 
aatgctccca 
ctagtgatct 
gcaaggtctc 
gagcatcaga 
ggggatgaag 
cgcactggag 
tttcaagaag 
aactgccatt 
gatacatcct 
CCTTTCTGGA 
GCCAGAAACA 
GGACAAGCTG 
ACTTCTCGCA 
CGAGAGGCCA 
GGCTATTGCA 
AGGGAAGTGG 
GTGACGCTGG 
GGTGGACTAC 
ATGTGGATGA 
TTTGTCATGA 
AGTGAACCTC 
ACATTCAGGA 



tgactggtgg 
AGAAGACAAA 
CGCCGATATC 
ggagaagaat 
cttcgtaaag 
gaagatctgg 
attttataaa 
tgctccatca 
ggctattaga 
acgtagctcg 
agggaccttg 
gtattatcac 
ctcactggcc 
ctgcactgca 
atggcaaaat 
accaaacgat 
tctgtggctt 
GCACATGTGC 
CCGGCATCAT 
AAGGAAATGA 
CGGCACCCAC 
CAGAGAGCTT 
CTGGACACCA 
CACAGCAGAG 
ACAATGCCTT 
AAGAACCTCA 
CGGTCTCATT 
CTGAGGTTGA 
CCAGGTGCTG 
CCTGAAATTT 



acagcaaatg 
GAAGAAGAGG 
acttctgaga 
cgaaccctgg 
aggcctgtct 
cgaagctcag 
aaaatttacg 
cctggttctt 
gagtttctga 
gcttttttgg 
ttaacagtgg 
tgctggagga 
acaataccca 
taattctaag 
catcttacat 
tccgccacac 
ggagagaatt 
CGCCTGGACA 
CTGCACCATC 
TTAAATCTGG 
GAGTATCATG 
TGCCTCTGAC 
AGGGACCTGA 
GTGGAGCTCA 
CATGGAGAAC 
TCAAAGTTAT 
TCCTTGCTGG 
GAACGGTGGC 
CGGTCGACCT 
GGCGTGGAGC 



ggtcgggatc 
CtctagaTAC 
aaggtccttc 
gagtttgacg 
gctctacgaa 
gcaaaaacac 
tcagaaagag 
gtcctggagt 
gtcggcaccc 
cacatggatc 
agtaactatt 
attttgtcaa 
cctctgtgga 
tcttccaccc 
ttttcagact 
atccttttag 
CCACGCTGCC 
TCGACTCCGA 
GGCCCAGCCT 
AATGAATGTT 
AGGGCACAAT 
CCGATCACCT 
AATCCGAACT 
AGAAGGGCGC 
TGCGATGAGA 
AGATGTGGGC 
TTAAGGAGAA 
ATGCTTGGTA 
GCCTGCAGTC 
AGAATGTGGA 



cgggatatgg 
CCCTACGACG 
aaccggtgac 
tcttctatga 
atcaagtggg 
caccaatcac 
attttcaccc 
ccctgctggg 
tggtgtgact 
aacaaaatcg 
cagattatga 
ctacccacct 
tgatgttgta 
tgtttaaaga 
tcatcttcaa 
ctacagggct 
ATGGCAGACA 
GCCAACCATT 
CCCGCTCTGT 
GCCCGCCTCA 
TAAGAACGTG 
ACAGACCTGT 
GGACTCATCA 
AGCTCTCAAA 
ATGTGCTGTG 
AGCAAAATCT 
AGGCAAGGAC 
GTAAGAAGGG 
TCAGAGAAGG 
CATGGTGTTC 
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GCTTCCTTCA TCCGCAAAGC TGCTGATGTC CATGCTGTCA GGAAGGTGCT 1550 

AGGGGAAAAG GGAAAGCACA TCAAGATTAT CAGCAAGATT GAGAATCACG 1600 

AGGGTGTGCG CAGGTTTGAT GAGATCATGG AGGCCAGCGA TGGCATTATG 1650 

GTGGCCCGTG GTGACCTGGG TATTGAGATC CCTGCTGAAA AAGTCTTCCT 1700 

CGCACAGAAG ATGATGATTG GGCGCTGCAA CAGGGCTGGC AAACCCATCA 1750 

TTTGTGCCAC TCAGATGTTG GAAAGCATGA TCAAGAAACC TCGCCCGACC 1800 

CGCGCTGAGG GCAGTGATGT TGCCAATGCA GTTCTGGATG GAGCAGACTG 1850 

CATCATGCTG TCTGGGGAGA CCGCCAAGGG AGACTACCCA CTGGAGGCTG 1900 

TGCGCATGCA GCACGCTATT. GCTCGTGAGG CTGAGGCCGC AATGTTCCAT 1950 

CGTCAGCAGT TTGAAGAAAT CTTACGCCAC AGTGTACACC ACAGGGAGCC 2000 

TGCTGATGCC ATGGCAGCAG GCGCGGTGGA GGCCTCCTTT AAGTGCTTAG 2050 

CAGCAGCTCT GATAGTTATG ACCGAGTCTG GCAGGTCTGC ACACCTGGTG 2100 

TCCCGGTACC GCCCGCGGGC TCCCATCATC GCCGTCACCC GCAATGACCA 2150 

AACAGCACGC CAGGCACACC TGTACCGCGG CGTCTTCCCC GTGCTGTGCA 2200 

AGCAGCCGGC CCACGATGCC TGGGCAGAGG ATGTGGATCT CCGTGTGAAC 2250 

CTGGGCATGA ATGTCGGCAA AGCCCGTGGA TTCTTCAAGA CCGGGGACCT 2300 

GGTGATCGTG CTGACGGGCT GGCGCCCCGG CTCCGGCTAC ACCAACACCA 2350 

TGCGGGTGGT GCCCGTGCCA gcggccgcac tcgagcacca ccaccaccac 2400 

cactga 2406 

Figure 1C 



MASMTGGQQM GRDPGYGRKK RKQRRRGSRY PYDVPDYADI TSEKGPSTGD 50 

PTLRRRIEPW EFDVFYDPRE LRKEACLLYE IKWGMSRKIW RSSGKNTTNH 100 

VEVNFIKKFT SERDFHPSIS CSITWFLSWS PCWECSQAIR EFLSRHPGVT 150 

LVIYVARLFW HMDQQNRQGL RDLVNSGVTI QIMRASEYYH CWRNFVNYPP 200 

GDEAHWPQYP PLWMMLYALE LHCIILSLPP CLKISRRWQN HLTFFRLHLQ 250 

NCHYQTIPPH ILLATGLIHP SVAWR EFHAA MADTFLEHMC RLDIDSEPTI 300 

ARNTGIICTI GPASRSVDKL KEMIKSGMNV ARLNFSHGTH EYHEGTIKNV 350 

REATESFASD PITYRPVAIA LDTKGPEIRT GLIKGSGTAE VELKKGAALK 400 

VTLDNAFMEN CDENVLWVDY KNLIKVIDVG SKIYVDDGLI SLLVKEKGKD 450 

FVMTEVENGG MLGSKKGVNL PGAAVDLPAV SEKDIQDLKF GVEQNVDMVF 500 

ASFIRKAADV HAVRKVLGEK GKHIKIISKI ENHEGVRRFD EIMEASDGIM 550 

VARGDLGIEI PAEKVFLAQK MMIGRCNRAG KPIICATQML ESMIKKPRPT 600 

RAEGSDVANA VLDGADCIML SGETAKGDYP LEAVRMQHAI AREAEAAMFH 650 

RQQFEEILRH SVHHREPADA MAAGAVEASF KCLAAALIVM TESGRSAHLV 700 

SRYRPRAPII AVTRNDQTAR QAHLYRGVFP VLCKQPAHDA WAEDVDLRVN 750 

LGMNVGKARG FFKTGDLVIV LTGWRPGSGY TNTMRWPVP AAALEHHHHH 800 

H 801 
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atggctagca 
aAGAAAAAAA 
TGCCCGACTA 
cccactctga 
cccccgggaa 
gaggaaggca 
gttgaagtca 
aaacaccaga 
agtgctccag 
ctgtttattt 
gcaaggactc 
cggagcaaga 
tcgaatgaag 
cgtactggaa 
ttttaagaag 
agctgccatt 
gaaagaattc 
GCCTGGACAT 
TGCACCATCG 
TAAATCTGGA 
AGTATCATGA 
GCCTCTGACC 
GGGACCTGAA 
TGGAGCTCAA 
ATGGAGAACT 
CAAAGTTATA 
CCTTGCTGGT 
AACGGTGGCA 
GGTCGACCTG 
GCGTGGAGCA 



tgactggtgg 
AGAAGACAAA 
CGCCGATATC 
ggagaagaat 
cttcggaaag 
cagcatctgg 
atttcataga 
tgctccatta 
ggccattaca 
atatagcacg 
agggacctta 
gtctggctac 
ctcattggcc 
ctctactgca 
aaaacaacct 
accaaaggct 
CACGCTGCCA 
CGACTCCGAG 
GCCCAGCCTC 
ATGAATGTTG 
GGGCACAATT 
CGATCACCTA 
ATCCGAACTG 
GAAGGGCGCA 
GCGATGAGAA 
GATGTGGGCA 
TAAGGAGAAA 
TGCTTGGTAG 
CCTGCAGTCT 
GAATGTGGAC 



acagcaaatg 
GAAGAAGAGG 
agttccgaga 
tgagccccac 
agacctgtct 
cgacacacga 
aaaatttact 
cctggttcct 
gaatttttga 
gctttatcac 
ttagcagcgg 
tgctggagga 
aaggtacccc 
tcattttagg 
caactcacgt 
accaccccac 
TGGCAGACAC 
CCAACCATTG 
CCGCTCTGTG 
CCCGCCTCAA 
AAGAACGTGC 
CAGACCTGTG 
GACTCATCAA 
GCTCTCAAAG 
TGTGCTGTGG 
GCAAAATCTA 
GGCAAGGACT 
TAAGAAGGGA 
CAGAGAAGGA 
ATGGTGTTCG 



ggtcgggatc 
CtctagaTAC 
caggccctgt 
gagtttgaag 
gctgtatgag 
gccaaaacac 
acagaaagat 
gtcctggagt 
gccgataccc 
cacgcagatc 
tgttactatc 
attttgtcaa 
catctgtggg 
acttccaccc 
ttttcacgat 
atcctgtggg 
CTTTCTGGAG 
CCAGAAACAC 
GACAAGCTGA 
CTTCTCGCAC 
GAGAGGCCAC 
GCTATTGCAC 
GGGAAGTGGC 
TGACGCTGGA 
GTGGACTACA 
TGTGGATGAC 
TTGTCATGAC 
GTGAACCTCC 
CATTCAGGAC 
CTTCCTTCAT 



cgggatatgg 50 
CCCTACGACG 100 
agctgttgat 150 
tcttctttga 200 
atcaactggg 250 
caacaaacac 300 
acttttgtcc 350 
ccctgtgggg 4 00 
ccatgtaact 4 50 
ctcgaaatcg 500 
cagatcatga 550 
ctactcccct 600 
tgaggctgta 650 
tgtttaaata 700 
tgctcttcaa 750 
ccacagggtt 800 
CACATGTGCC 850 
CGGCATCATC 900 
AGGAAATGAT 950 
GGCACCCACG 1000 
AGAGAGCTTT 1050 
TGGACACCAA 1100 
ACAGCAGAGG 1150 
CAATGCCTTC 1200 
AGAACCTCAT 1250 
GGTCTCATTT 1300 
TGAGGTTGAG 1350 
CAGGTGCTGC 1400 
CTGAAATTTG 1450 
CCGCAAAGCT 1500 
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GCTGATGTCC ATGCTGTCAG GAAGGTGCTA GGGGAAAAGG GAAAGCACAT 
CAAGATTATC AGCAAGATTG AGAATCACGA GGGTGTGCGC AGGTTTGATG 
AGATCATGGA GGCCAGCGAT GGCATTATGG TGGCCCGTGG TGACCTGGGT 
ATTGAGATCC CTGCTGAAAA AGTCTTCCTC GCACAGAAGA TGATGATTGG 
GCGCTGCAAC AGGGCTGGCA AACCCATCAT TTGTGCCACT CAGATGTTGG 
AAAGCATGAT CAAGAAACCT CGCCCGACCC GCGCTGAGGG CAGTGATGTT 
GCCAATGCAG TTCTGGATGG AGCAGACTGC ATCATGCTGT CTGGGGAGAC 
CGCCAAGGGA GACTACCCAC TGGAGGCTGT GCGCATGCAG CACGCTATTG 
CTCGTGAGGC TGAGGCCGCA ATGTTCCATC GTCAGCAGTT TGAAGAAATC 
TTACGCCACA GTGTACACCA CAGGGAGCCT GCTGATGCCA TGGCAGCAGG 
CGCGGTGGAG GCCTCCTTTA AGTGCTTAGC AGCAGCTCTG ATAGTTATGA 
CCGAGTCTGG CAGGTCTGCA CACCTGGTGT CCCGGTACCG CCCGCGGGCT 
CCCATCATCG CCGTCACCCG CAATGACCAA ACAGCACGCC AGGCACACCT 
GTACCGCGGC GTCTTCCCCG TGCTGTGCAA GCAGCCGGCC CACGATGCCT 
GGGCAGAGGA TGTGGATCTC CGTGTGAACC TGGGCATGAA TGTCGGCAAA 
GCCCGTGGAT TCTTCAAGAC CGGGGACCTG GTGATCGTGC TGACGGGCTG 
GCGCCCCGGC TCCGGCTACA CCAACACCAT GCGGGTGGTG CCCGTGCCAg 
cggccgcact cgagcaccac caccaccacc actga 

Figure 2C 



MASMTGGQQM GRDPGYGRKK RRQRRRGSRY P YD VP DYAD I SSETGPVAVD 50 

PTLRRRIEPH EFEVFFDPRE LRKETCLLYE INWGGRHSIW RHTSQNTNKH 100 

VEVNFIEKFT TERYFCPNTR CSITWFLSWS PCGECSRAIT EFLSRYPHVT 150 

LFIYIARLYH HADPRNRQGL RDLISSGVTI QIMTEQESGY CWRNFVNYSP 200 

SNEAHWPRYP HLWVRLYVLE LYCIILGLPP CLNILRRKQP QLTFFTIALQ 250 

SCHYQRLPPH ILWATGLK EF HAAMADTFLE HMCRLDIDSE PTIARNTGII 300 

CTIGPASRSV DKLKEMIKSG MNVARLNFSH GTHEYHEGTI KNVREATESF 350 

ASDPITYRPV AIALDTKGPE IRTGLIKGSG TAEVELKKGA ALKVTLDNAF 400 

MENCDENVLW VDYKNLIKVI DVGSKIYVDD GLISLLVKEK GKDFVMTEVE 450 

NGGMLGSKKG VNLPGAAVDL PAVSEKDIQD LKFGVEQNVD MVFASFIRKA 500 

ADVHAVRKVL GEKGKHIKII SKIENHEGVR RFDEIMEASD GIMVARGDLG 550 

IEIPAEKVFL AQKMMIGRCN RAGKPIICAT QMLESMIKKP RPTRAEGSDV 600 

ANAVLDGADC IMLSGETAKG DYPLEAVRMQ HAIAREAEAA MFHRQQFEEI 650 

LRHSVHHREP ADAMAAGAVE ASFKCLAAAL IVMTESGRSA HLVSRYRPRA 700 

PIIAVTRNDQ TARQAHLYRG VFPVLCKQPA HDAWAEDVDL RVNLGMNVGK 750 

ARGFFKTGDL VIVLTGWRPG SGYTNTMRW PVPAAALEHH HHHH ~ ~ 794 

Figure 2D 
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Figure 3 A 



MASMTGGQQM GRDPGYGRKK RRQRRRGSRY PYDVPDYADI MESNHKSGDG 50 

LSGTQKEAAL RALVQRTGYS LVQENGQRKY GGPPPGWDAA PPERGCEIFI 100 

GKLPRDLFED ELIPLCEKIG KIYEMRMMMD FNGNNRGYAF VTFSNKVEAK 150 

NAIKQLNNYE IRNGRLLGVC ASVDNCRLFV GGIPKTKKRE EILSEMKKVT 200 

EGWDVIVYP SAADKTKNRG FAFVEYESHR TAAMARRKLL PGRIQLWGHG 250 

IAVDWAEPEV EVDEDTMSSV KILYVRNLML STSEEMIEKE FNNIKPGAVE 300 

RVKKIRDYAF VHFSNRKDAV EAMKALNGKV LDGSPIEVTL AKPVDKDSYV 350 

RYTRGTGGRG TMLQGEYTYS LGQVYDPTTT YLGAPVFYAP QTYAAIPSLH 400 

FPATKGHLSN RAIIRAPSVR GAAGVRGLGG RGYLAYTGLG RGYQVKGDKR 450 

EDKLYDILPG MELTPMNPVT LKPQGIKLAP QILEEICQKN NWGQPVYQLH 500 

SAIGQDQRQL FLYKITIPAL ASQNPAIHPF TPPKLSAFVD EAKTYAAEYT 550 

LQTLGIPTDG GDGTMATAAA AATAFPGYAV PNATAPVSAA QLKQAVTLGQ 600 

DLAAYTTYEV YPTFAVTARG DGYGTFAAAL EHHHHHH 637 
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ATGGCTAGCA TGACTGGTGG ACAGCAAATG GGTCGGGATC CGGGATATGG 50 
AAGAAAAAAA AGAAGACAAA GAAGAAGAGG CTCTAGATAC CCCTACGACG 100 
TGCCCGACTA CGCCGATATC atggaatcaa atcacaaatc cggggatgga 150 
ttgagcggca ctcagaagga agcagccctc cgcgcactgg tccagcgcac 200 
aggatatagc ttggtccagg aaaatggaca aagaaaatat ggtggccctc 250 
cacctggttg ggatgctgca ccccctgaaa ggggctgtga aatttttatt 300 
ggaaaacttc cccgagacct ttttgaggat gagcttatac cattatgtga 3 50 
aaaaatcggt aaaatttatg aaatgagaat gatgatggat tttaatggca 400 
acaatagagg atatgcattt gtaacatttt caaataaagt ggaagccaag 4 50 
aatgcaatca agcaacttaa taattatgaa attagaaatg ggcgcctctt 500 
aggggtttgt gccagtgtgg acaactgccg attatttgtt gggggcatcc 550 
caaaaaccaa aaagagagaa gaaatcttat cggagatgaa aaaggttact 600 
gaaggtgttg tcgatgtcat cgtctaccca agcgctgcag ataaaaccaa 650 
aaaccgaggc tttgccttcg tggagtatga gagtcatcga acagctgcca 700 
tggcgaggag gaaactgcta ccaggaagaa ttcagttatg gggacatggt 750 
attgcagtag actgggcaga gccagaagta gaagttgatg aagatacaat 800 
gtcttcagtg aaaatcctat atgtaagaaa tcttatgctg tctacctctg 850 
aagagatgat tgaaaaggaa ttcaacaata tcaaaccagg tgctgtggag 900 
agggtgaaga aaattcgaga ctatgctttt gtgcacttca gtaaccgaaa 950 
agatgcagtt gaggctatga aagctttaaa tggcaaggtg ctggatggtt 1000 
cccccattga agtcacccta gcaaaaccag tggacaagga cagttatgtt 1050 
aggtataccc gaggcacagg tggaaggggc accatgctgc aaggagagta 1100 
tacctactct ttgggccaag tttatgatcc caccacaacc taccttggag 1150 
ctcctgtctt ctatgccccc cagacctatg cagcaattcc cagtcttcat 1200 
ttcccagcca ccaaaggaca tctcagcaac agagccatta tccgagcccc 1250 
ttctgttaga ggggctgcgg gagtgagagg actgggcggc cgtggctatt 1300 
tggcatacac aggcctgggt cgaggatacc aggtcaaagg agacaaaaga 1350 
gaagacaaac tctatgacat tttacctggg atggagctca ccccaatgaa 14 00 
tcctgtcaca ttaaaacccc aaggaattaa actcgctccc cagatattag 14 50 
aagagatttg tcagaaaaat aactggggac agccagtgta ccagctgcac 1500 
tctgctattg gacaagacca aagacagcta ttcttgtaca aaataactat 1550 
tcctgctcta gccagccaga atcctgcaat ccaccctttc acacctccaa 1600 
agctgagtgc ctttgtggat gaagcaaaga cgtatgcagc cgaatacacc 1650 
ctgcagaccc tgggcatccc cactgatgga ggcgatggca ccatggctac 1700 
tgctgctgct gctgctactg ctttcccagg atatgctgtc cctaatgcaa 1750 
ctgcacccgt gtctgcagcc cagctcaagc aagcggtaac ccttggacaa 1800 
gacttagcag catatacaac ctatgaggtc tacccaactt ttgcagtgac 1850 
tgcccgaggg gatggatatg gcaccttcGC GGCCGCACTC GAGCACCACC 1900 
ACCACCACCA CTGA 1914 
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Figure 4 A 



MESNHKSGDG 
PPERGCEIFI 



GRDPGYGRKK RRQRRRGSRY PYDVPDYADI 
RALVQRTGYS LVQENGQRKY GGPPPGWDAA 
ELIPLCEKIG KIYEMRMMMD FNGNNRGYAF VTFSNKVEAK 
IRNGRLLGVC ASVDNCRLFV GGIPKTKKRE EILSEMKKVT 

PGRIQLWGHG 
FNNIKPGAVE 
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LSGTQKEAAL 
GKLPRDLFED 
NAIKQLNNYE 

EGWDVIVYP SAADKTKNRG FAFVEYESHR TAAMARRKLL 
IAVDWAEPEV EVDEDTMSSV KILYVRNLML STSEEMIEKE 
RVKKIRDYAF VHFSNRKDAV EAMKALNGKV LDGSPIEVTL AKPVDKDSYV 
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ATGGCTAGCA TGACTGGTGG ACAGCAAATG 
AAGAAAAAAA AGAAGACAAA GAAGAAGAGG 
TGCCCGACTA CGCCGATATC atggaatcaa 
ttgagcggca ctcagaagga agcagccctc 
aggatatagc ttggtccagg aaaatggaca 
cacctggttg ggatgctgca ccccctgaaa 
ggaaaacttc cccgagacct ttttgaggat 
aaaaatcggt aaaatttatg aaatgagaat 
acaatagagg atatgcattt gtaacatttt 
aatgcaatca agcaacttaa taattatgaa 
aggggtttgt gccagtgtgg acaactgccg 
caaaaaccaa aaagagagaa gaaatcttat 
gaaggtgttg tcgatgtcat cgtctaccca 
aaaccgaggc tttgccttcg tggagtatga 
tggcgaggag gaaactgcta ccaggaagaa 
attgcagtag actgggcaga gccagaagta 
gtcttcagtg aaaatcctat atgtaagaaa 
aagagatgat tgaaaaggaa ttcaacaata 
agggtgaaga aaattcgaga ctatgctttt 
agatgcagtt gaggctatga aagctttaaa 
cccccattga agtcacccta gcaaaaccag 
aggtataccc gaggcacagg tggaaggggc 
tacctactct ttgggccaag tttatgatcc 
ctcctgtctt ctatgccccc cagacctatg 
ttcccagcca ccaaaggaca tctcagcaac 
ttctgttaga ggggctgcgg gagtgagagg 
tggcatacac aggcctgggt cgaggatacc 
gaagacaaac tctatgacat tttacctggg 
tcctgtcaca ttaaaacccc aaggaattaa 
aagagatttg tcagaaaaat aactggggac 
tctgctattg gacaagacca aagacagcta 
tcctgctcta gccagccaga atcctgcaat 
agctgagtgc ctttgtggat gaagcaaaga 
ctgcagaccc tgggcatccc cactgatgga 
tgctgctgct gctgctactg ctttcccagg 
ctgcacccgt gtctgcagcc cagctcaagc 
gacttagcag catatacaac ctatgaggtc 
tgcccgaggg gatggatatg gcaccttcGC 
ACCACCACCA CTGA 



GGTCGGGATC CGGGATATGG 50 

CTCTAGATAC CCCTACGACG 100 

atcacaaatc cggggatgga 150 

cgcgcactgg tccagcgcac 200 

aagaaaatat ggtggccctc 250 

9993Ctgtga aatttttatt 300 

gagcttatac cattatgtga 350 

gatgatggat tttaatggca 4 00 

caaataaagt ggaagccaag 450 

attagaaatg ggcgcctctt 500 

attatttgtt gggggcatcc 550 

cggagatgaa aaaggttact 600 

agcgctgcag ataaaaccaa 650 

gagtcatcga acagctgcca 700 

ttcagttatg gggacatggt 750 

gaagttgatg aagatacaat 800 

tcttatgctg tctacctctg 850 

tcaaaccagg tgctgtggag 900 

gtgcacttca gtaaccgaaa 950 

tggcaaggtg ctggatggtt 1000 

tggacaagga cagttatgtt 1050 

accatgctgc aaggagagta 1100 

caccacaacc taccttggag 1150 

cagcaattcc cagtcttcat 1200 

agagccatta tccgagcccc 1250 

actgggcggc cgtggctatt 1300 

aggtcaaagg agacaaaaga 1350 

atggagctca ccccaatgaa 1400 

actcgctccc cagatattag 14 50 

agccagtgta ccagctgcac 1500 

ttcttgtaca aaataactat 1550 

ccaccctttc acacctccaa 1600 

cgtatgcagc cgaatacacc 1650 

ggcgatggca ccatggctac 1700 

atatgctgtc cctaatgcaa 1750 

aagcggtaac ccttggacaa 1800 

tacccaactt ttgcagtgac 1850 

GGCCGCACTC GAGCACCACC 1900 

1914 
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<212> DNA . 

<213> Artificial Sequence 

<220> .... 

<223> Description of Artificial. Sequence: 
TAT-hAPOBEC-CMPK ... 



<400> 1. . 

atggctagca tgactggtgg. acagcaaatg. ggtcgggatc cgggatatgg aagaaaaaaa 60 
agaagacaaa gaagaagagg. ctctagatac. ccctacgacg tgcccgacta cgccgatatc 120 
acttctgaga aaggtccttc aaccggtgac cccactctga ggagaagaat cgaaccctgg. 180 
gagtttgacg tcttctatga ccccagagaa. cttcgtaaag aggcctgtct gctctacgaa 240. 
atcaagtggg. gcatgagccg gaagatctgg. cgaagctcag gcaaaaacac caccaatcac 300. 
gtggaagtta attttataaa aaaatttacg. tcagaaagag attttcaccc atccatcagc. 360 
tgctccatca cctggttctt. gtcctggagt. ccctgctggg aatgctccca ggctattaga 420 
gagtttctga gtcggcaccc. tggtgtgact ctagtgatct. acgtagctcg gcttttttgg 480 
cacatggatc aacaaaatcg gcaaggtctc agggaccttg. ttaacagtgg. agtaactatt. 540 . 
cagattatga gagcatcaga gtattatcac tgctggagga attttgtcaa ctacccacct 600 . 
ggggatgaag ctcactggcc. acaataccca. cctctgtgga tgatgttgta cgcactggag 660 . 
ctgcactgca taattctaag tcttccaccc tgtttaaaga tttcaagaag atggcaaaat. 720 . 
catcttacat ttttcagact tcatcttcaa aactgccatt accaaacgat tccgccacac 780. . 
atccttttag ctacagggct gatacatcct. tctgtggctt ggagagaatt ccacgctgcc 840 
atggcagaca cctttctgga. gcacatgtgc cgcctggaca tcgactccga gccaaccatt. 900. . 
gccagaaaca ccggcatcat ctgcaccatc ggcccagcct cccgctctgt ggacaagctg 960. . 
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aaggaaatga ttaaatctgg aatgaatgtt gcccgcctca. acttctcgca cggcacccac 1020 

gagtatcatg agggcacaat taagaacgtg cgagaggcca cagagagctt tgcctctgac 1080 

ccgatcacct acagacctgt ggctattgca ctggacacca agggacctga aatccgaact 1140 

ggactcatca agggaagtgg cacagcagag gtggagctca agaagggcgc agctctcaaa 1200 

gtgacgctgg acaatgcctt catggagaac tgcgatgaga atgtgctgtg ggtggactac 1260 

aagaacctca tcaaagttat agatgtgggc. agcaaaatct atgtggatga cggtctcatt 1320 

tccttgctgg ttaaggagaa aggcaaggac tttgtcatga ctgaggttga gaacggtggc 1380 

atgcttggta gtaagaaggg. agtgaacctc. ccaggtgctg. cggtcgacct gcctgcagtc 1440 

tcagagaagg acattcagga cctgaaattt ggcgtggagc agaatgtgga catggtgttc 1500 

gcttccttca tccgcaaagc tgctgatgtc catgctgtca ggaaggtgct aggggaaaag 1560 

ggaaagcaca. tcaagattat cagcaagatt gagaatcacg agggtgtgcg caggtttgat 1620 

gagatcatgg aggccagcga tggcattatg. gtggcccgtg gtgacctggg tattgagatc 1680 

cctgctgaaa aagtcttcct cgcacagaag. atgatgattg ggcgctgcaa cagggctggc 1740 

aaacccatca. tttgtgccac tcagatgttg gaaagcatga tcaagaaacc tcgcccgacc 1800 

cgcgctgagg. gcagtgatgt tgccaatgca gttctggatg gagcagactg. catcatgctg 1860 

tctggggaga. ccgccaaggg agactaccca. ctggaggctg tgcgcatgca gcacgctatt 1920 

gctcgtgagg ctgaggccgc aatgttccat. cgtcagcagt ttgaagaaat cttacgccac 1980 

agtgtacacc. acagggagcc tgctgatgcc atggcagcag gcgcggtgga. ggcctccttt 2040 

aagtgcttag cagcagctct gatagttatg accgagtctg. gcaggtctgc acacctggtg 2100. 

tcccggtacc gcccgcgggc tcccatcatc gccgtcaccc. gcaatgacca aacagcacgc 2160. 

caggcacacc tgtaccgcgg cgtcttcccc. gtgctgtgca agcagccggc ccacgatgcc 2220 

tgggcagagg atgtggatct ccgtgtgaac ctgggcatga atgtcggcaa agcccgtgga 2280. 

ttcttcaaga ccggggacct ggtgatcgtg. ctgacgggct. ggcgccccgg ctccggctac 234 0 

accaacacca tgcgggtggt. gcccgtgcca gcggccgcac tcgagcacca ccaccaccac 2400. 

cactga 2406 

<210> 2 . . 

<211> 801 

<212>. PRT. 

<213>. Artificial. Sequence . ... 

<220>. ... 

<223>. Description of Artificial. Sequence:. . 

TAT-hAPOBEOCMPK 

<400>. 2. 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Gly Tyr 

1. 5 . 10 15 . . 

Gly. Arg Lys. Lys Arg Arg. Gin Arg Arg Arg Gly. Ser Arg. Tyr Pro Tyr . . . . 

20 25 30 

Asp Val. Pro. Asp Tyr Ala Asp He Thr Ser Glu Lys. Gly Pro Ser Thr 

. . 35 40 45 



Gly Asp Pro. Thr Leu Arg Arg Arg. He Glu Pro Trp Glu Phe Asp Val. 
50 55 60. . . 
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Phe. Tyr Asp Pro Arg Glu Leu Arg Lys Glu Ala. Cys Leu Leu Tyr Glu 

65 70 ... .75 80 

lie Lys Trp Gly Met Ser Arg Lys lie Trp Arg. Ser Ser Gly. Lys. Asn . 
. . . .85 90 95. 



. Thr Thr Asn His Val Glu Val Asn Phe lie Lys Lys Phe Thr Ser Glu 

100 105 .... 110 

Arg Asp. Phe His. Pro Ser lie Ser Cys. Ser. lie Thr. Trp Phe Leu Ser. . . 

115 .... 120 125 . . . 

. Trp Ser Pro Cys. Trp Glu Cys Ser. Gin Ala. lie Arg Glu Phe Leu Ser . . 
130 135 140 

Arg His Pro. Gly Val Thr Leu Val. lie Tyr Val Ala Arg Leu Phe Trp. . 
145 150. 155 .... 160. 

. His Met Asp Gin Gin Asn Arg Gin Gly Leu Arg. Asp Leu Val Asn Ser. . . 
165 170 175 

. Gly Val. Thr lie Gin lie. Met Arg Ala Ser Glu Tyr. Tyr. His. Cys Trp 
180 185 190. 

Arg Asn Phe Val Asn Tyr. Pro Pro Gly. Asp Glu Ala His Trp. Pro. Gin . . 

195 200 .205 

Tyr Pro. Pro Leu Trp Met Met Leu Tyr. Ala Leu Glu Leu His. Cys. lie 
. 210. ... ■ 215 . . 220 



. . lie Leu Ser Leu Pro Pro. Cys. Leu 
225 230 

. . His Leu Thr Phe Phe Arg Leu His. 
245 

lie. Pro Pro His lie Leu Leu Ala. 
.260 



Lys lie Ser Arg Arg Trp. Gin Asn 
235 240. . 

Leu Gin Asn Cys. His. Tyr. Gin Thr 
250 255 

Thr. Gly Leu He His Pro Ser. Val. 
265 270 



. . . . Ala Trp. Arg Glu Phe His Ala Ala Met Ala Asp Thr Phe. Leu Glu His 
275 280. . ... 285 

. . . . Met Cys Arg Leu Asp. He Asp Ser Glu Pro. Thr He Ala Arg Asn Thr 

290 . 295 .... 300. . 



Gly He He Cys. Thr He Gly Pro. Ala Ser. Arg. Ser. Val Asp Lys Leu 

.... 305 .... 310 315 .... . . 320. 
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Lys Glu Met lie Lys Ser Gly Met Asn Val Ala Arg. Leu Asn Phe. Ser 

325 330 335 

His Gly Thr His. Glu Tyr His Glu Gly. Thr lie Lys Asn Val Arg Glu 
340. . ... 345 350 

Ala Thr Glu Ser. Phe Ala Ser. Asp. Pro lie Thr Tyr Arg Pro Val Ala 
355 . . 360 365. . 

lie Ala Leu Asp. Thr Lys Gly Pro. Glu lie Arg. Thr Gly. Leu lie Lys 
. . 370. 375 380. ... 

Gly Ser Gly. Thr. Ala Glu Val. Glu Leu Lys Lys Gly. Ala Ala Leu Lys . . 
385 390 395 .400 

. Val Thr Leu Asp Asn Ala Phe Met Glu Asn Cys. Asp Glu Asn Val Leu . . . 

405 410 415 

Trp Val. Asp Tyr. Lys Asn Leu lie Lys. Val lie Asp Val. Gly. Ser. Lys 
420 . 425. 430. . 

lie Tyr Val Asp Asp Gly. Leu lie Ser Leu Leu Val Lys Glu Lys. Gly 
. 435 . 440 445 . 

. . . Lys Asp Phe Val. Met Thr Glu Val. Glu Asn Gly Gly Met Leu Gly. Ser 
... 450 455. 460 

. . Lys Lys Gly. Val. Asn Leu Pro Gly Ala Ala Val Asp Leu Pro Ala Val. . 
465 470 475 . . 480. . . 

Ser Glu Lys Asp. lie Gin Asp. Leu Lys Phe. Gly. Val. Glu Gin Asn Val . . 

485 490 ... 495. 

. . Asp. Met Val. Phe Ala Ser Phe lie. Arg Lys Ala Ala Asp Val His. Ala 
.500 ... 505 .... 510 

Val. Arg Lys Val. Leu Gly Glu Lys. Gly Lys His lie Lys. He He Ser 
. . 515 520 525 

Lys. He. Glu Asn His. Glu Gly Val Arg Arg. Phe. Asp. Glu He Met Glu 
. 530. . . ... 535 .... 540 

Ala Ser. Asp Gly He. Met Val Ala Arg. Gly. Asp. Leu Gly He Glu He 

. 545 ... . . 550 ... . 555 560 . 

Pro Ala. Glu Lys Val Phe Leu Ala Gin Lys. Met Met. He Gly. Arg. Cys 
565. . ... 570 575 
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Asn Arg Ala Gly Lys. Pro lie lie Cys Ala Thr Gin Met Leu Glu Ser. 
580 .... 585 . 590 

Met lie Lys Lys Pro Arg. Pro Thr Arg Ala Glu Gly Ser Asp Val Ala 

595 .... 600 605 

. Asn Ala Val Leu Asp Gly. Ala Asp Cys lie Met Leu Ser Gly Glu Thr 
. . . 610 615. . 620 

Ala Lys Gly. Asp Tyr. Pro. Leu Glu Ala Val Arg Met Gin His Ala lie 
625. . . . . 630 635. ... 640 

Ala. Arg Glu Ala. Glu Ala Ala Met Phe His Arg Gin Gin Phe Glu Glu 

.... 645 . 650 . ... 655 

lie Leu Arg. His Ser Val His His Arg. Glu Pro Ala Asp. Ala Met Ala 
660 665 ... 670. . 

Ala Gly Ala Val. Glu Ala Ser Phe Lys Cys Leu Ala Ala Ala Leu lie 
675 .680 ... 685 ... . 

Val. Met Thr Glu Ser. Gly Arg Ser. Ala His Leu Val Ser Arg Tyr. Arg . 
690. . 695 700. ... 

. Pro Arg Ala Pro. lie lie. Ala Val Thr. Arg. Asn Asp Gin Thr. Ala Arg 
. 705 710. . . 715 720 . 

Gin Ala His. Leu Tyr. Arg. Gly Val Phe. Pro. Val Leu Cys. Lys Gin Pro. 

725 . 730 735 . . 

Ala His. Asp Ala Trp. Ala Glu Asp Val. Asp Leu Arg Val. Asn Leu Gly . 
.740 745 750 

. Met Asn Val. Gly Lys. Ala. Arg Gly Phe. Phe. Lys Thr Gly. Asp Leu Val 
. . . 755 760 765 

. lie Val Leu Thr Gly. Trp Arg. Pro Gly. Ser Gly Tyr Thr Asn Thr Met 

. . . 770 . . . 775 ... . 780. . . 

Arg Val. Val Pro Val Pro Ala Ala Ala Leu Glu His. His His His His. . 
785 .. . 790 795 800. . 

. His. 
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<211> 2385 

<212> DNA .... 
<213> Artificial Sequence 

<220> .... 
<223> Description of Artificial Sequence: 

TAT-rAPOBEC-CMPK .... ... 

. . <400> 3 

. . atggctagca tgactggtgg acagcaaatg ggtcgggatc cgggatatgg aagaaaaaaa 60 . 

. agaagacaaa gaagaagagg ctctagatac. ccctacgacg. tgcccgacta cgccgatatc 120 

agttccgaga caggccctgt agctgttgat. cccactctga ggagaagaat tgagccccac 180. 

. gagtttgaag tcttctttga cccccgggaa. cttcggaaag agacctgtct gctgtatgag. 240 . 

. . atcaactggg gaggaaggca cagcatctgg. cgacacacga gccaaaacac caacaaacac 300 

. . gttgaagtca atttcataga aaaatttact. acagaaagat. acttttgtcc aaacaccaga 360 

tgctccatta cctggttcct gtcctggagt ccctgtgggg. agtgctccag ggccattaca 420 

gaatttttga. gccgataccc ccatgtaact ctgtttattt. atatagcacg gctttatcac. 480 

cacgcagatc ctcgaaatcg gcaaggactc agggacctta. ttagcagcgg tgttactatc. 540 

cagatcatga cggagcaaga gtctggctac. tgctggagga attttgtcaa. ctactcccct 600. . . 

tcgaatgaag ctcattggcc aaggtacccc. catctgtggg tgaggctgta cgtactggaa 660. . 

ctctactgca tcattttagg acttccaccc. tgtttaaata ttttaagaag aaaacaacct 720 

caactcacgt ttttcacgat tgctcttcaa agctgccatt accaaaggct accaccccac 780 . . 

atcctgtggg ccacagggtt gaaagaattc cacgctgcca tggcagacac ctttctggag 840. 

cacatgtgcc gcctggacat cgactccgag ccaaccattg ccagaaacac cggcatcatc 900 

tgcaccatcg gcccagcctc ccgctctgtg gacaagctga aggaaatgat taaatctgga 960 

atgaatgttg. cccgcctcaa cttctcgcac ggcacccacg. agtatcatga gggcacaatt 1020 

aagaacgtgc gagaggccac agagagcttt gcctctgacc cgatcaccta cagacctgtg. 1080 

. . gctattgcac tggacaccaa gggacctgaa atccgaactg. gactcatcaa gggaagtggc. 1140. 

acagcagagg tggagctcaa gaagggcgca gctctcaaag. tgacgctgga caatgccttc 1200 

atggagaact gcgatgagaa tgtgctgtgg gtggactaca agaacctcat caaagttata 1260 

. . gatgtgggca gcaaaatcta tgtggatgac. ggtctcattt ccttgctggt taaggagaaa 1320 

ggcaaggact ttgtcatgac tgaggttgag aacggtggca tgcttggtag. taagaaggga 1380 

gtgaacctcc. caggtgctgc ggtcgacctg cctgcagtct. cagagaagga cattcaggac 1440 

ctgaaatttg gcgtggagca gaatgtggac atggtgttcg. cttccttcat ccgcaaagct 1500 

gctgatgtcc atgctgtcag gaaggtgcta ggggaaaagg. gaaagcacat caagattatc. 1560 

agcaagattg agaatcacga gggtgtgcgc. aggtttgatg. agatcatgga ggccagcgat 1620 

ggcattatgg tggcccgtgg tgacctgggt attgagatcc ctgctgaaaa agtcttcctc 1680 

. . . gcacagaaga tgatgattgg gcgctgcaac agggctggca aacccatcat ttgtgccact. 1740 

cagatgttgg aaagcatgat. caagaaacct. cgcccgaccc gcgctgaggg. cagtgatgtt 1800. 

. . gccaatgcag ttctggatgg agcagactgc atcatgctgt ctggggagac. cgccaaggga 1860. . 

gactacccac tggaggctgt gcgcatgcag cacgctattg ctcgtgaggc tgaggccgca 1920 

. atgttccatc gtcagcagtt tgaagaaatc ttacgccaca gtgtacacca cagggagcct 1980 . 

gctgatgcca tggcagcagg cgcggtggag gcctccttta agtgcttagc agcagctctg 2040 . 

atagttatga ccgagtctgg caggtctgca cacctggtgt cccggtaccg. cccgcgggct 2100 

cccatcatcg ccgtcacccg caatgaccaa acagcacgcc aggcacacct. gtaccgcggc 2160. 

. . gtcttccccg tgctgtgcaa gcagccggcc cacgatgcct. gggcagagga tgtggatctc 2220 . 

. . . cgtgtgaacc tgggcatgaa tgtcggcaaa gcccgtggat tcttcaagac cggggacctg 2280. . 

gtgatcgtgc tgacgggctg gcgccccggc tccggctaca ccaacaccat gcgggtggtg 2340 . 
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cccgtgccag cggccgcact cgagcaccac caccaccacc actga 2385 



<210> 4 

<211> 794 ... 
<212>. PRT 

<213> Artificial Sequence 



<220> .... .... 

<223>. Description of Artificial Sequence: . 

. TAT- rAPOBEC-CMPK 

<400> 4. ... 

Met. Ala Ser. Met Thr Gly Gly Gin Gin Met Gly Arg Asp Pro Gly Tyr 

.1 ... 5 . ' . . . . 10 ... 15 . 

Gly. Arg Lys Lys Arg Arg Gin Arg. Arg Arg Gly Ser. Arg. Tyr Pro Tyr 

... 20 25 ... 30 

Asp Val Pro Asp Tyr. Ala Asp lie Ser Ser Glu Thr Gly Pro Val Ala 

.35 . 40 . 45 . 

Val Asp. Pro. Thr Leu Arg Arg Arg lie Glu Pro. His. Glu Phe Glu Val 

50. . . 55 . . 60. 

Phe Phe Asp Pro Arg Glu Leu Arg Lys. Glu Thr Cys. Leu Leu Tyr Glu 

65 .70. 75 80. 

lie. Asn Trp. Gly Gly. Arg His Ser lie. Trp Arg His. Thr. Ser Gin Asn 

85 90 95 ... 



Thr Asn Lys His Val. Glu Val Asn Phe lie Glu Lys. Phe Thr Thr Glu 

... 100 105 110 

Arg Tyr Phe Cys Pro Asn Thr Arg Cys. Ser lie Thr. Trp. Phe Leu Ser. 

.115 120 125. . . 

Trp. Ser Pro Cys Gly Glu Cys Ser Arg. Ala lie Thr. Glu Phe Leu Ser 

. .130 135. . . 140 

Arg Tyr Pro His Val. Thr Leu Phe lie Tyr lie. Ala. Arg Leu Tyr. His 

145 ... 150 . .155 160 

His Ala Asp Pro. Arg. Asn Arg Gin Gly. Leu Arg Asp. Leu lie. Ser. Ser 

. . . 165 170 175 . 

Gly Val Thr He Gin He Met. Thr Glu Gin Glu Ser. Gly. Tyr Cys. Trp 

. . . . 180 185 190 , 
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Arg Asn Phe Val Asn Tyr Ser Pro Ser Asn Glu Ala His Trp Pro. Arg. 

195 . 200 205. . . . 

Tyr Pro His Leu Trp Val Arg Leu Tyr Val Leu Glu Leu Tyr Cys He 

210. . 215 . . 220 

He Leu Gly Leu Pro Pro Cys Leu Asn He Leu Arg Arg Lys. Gin Pro 
.225 .... 230. ... 235 240 

Gin Leu Thr Phe Phe Thr. He Ala Leu Gin Ser Cys. His Tyr Gin Arg 

. . 245 .... 250 . 255. . . 

Leu Pro. Pro. His. He Leu Trp Ala Thr. Gly Leu Lys Glu Phe. His Ala . . 

260. 265 .... 270 

. Ala Met. Ala Asp. Thr Phe. Leu Glu His Met Cys Arg Leu Asp. He Asp .... 

. ... 275 280 . . 285 

. Ser Glu Pro Thr. lie. Ala Arg. Asn Thr. Gly He He. Cys. Thr He. Gly 

290. ... ... 295. . . 300. ... . . . 

Pro Ala Ser. Arg Ser. Val Asp Lys Leu Lys Glu Met He Lys Ser Gly 

305 .... . 310 .... 315 320 . 

. Met Asn Val Ala Arg Leu Asn Phe Ser His Gly. Thr. His Glu Tyr His 

.... .... 325 330 335 . . . 

Glu Gly Thr. He Lys Asn Val Arg Glu Ala Thr Glu Ser. Phe. Ala. Ser 
340 345. 350 

. Asp Pro He Thr. Tyr. Arg Pro Val Ala He Ala Leu Asp. Thr. Lys. Gly. 

.... 355 . . .... 360 . .... 365. 

. Pro. Glu He. Arg Thr Gly. Leu He Lys Gly Ser. Gly. Thr Ala. Glu Val 

. 370 .375 . . 380. ... 

Glu Leu Lys Lys Gly Ala. Ala Leu Lys Val. Thr Leu Asp Asn Ala Phe . . . 

385 390 . . 395 400 

Met Glu Asn Cys Asp. Glu Asn Val Leu Trp Val Asp Tyr. Lys. Asn Leu 
405. 410 415 

He Lys. Val. He Asp Val. Gly. Ser Lys He Tyr Val Asp. Asp Gly Leu 

420. . 425. ... .... 430. 



He Ser. Leu Leu Val Lys. Glu Lys Gly Lys. Asp. Phe Val Met Thr Glu 
. . 435. 440 . 445 . 
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Val Glu Asn Gly Gly Met. Leu Gly Ser Lys Lys Gly Val Asn Leu Pro 
450 455 .460 

Gly Ala Ala Val Asp Leu Pro. Ala Val Ser Glu Lys. Asp lie Gin Asp 

465 . . 470 . . . . 475 . 480 

Leu Lys Phe Gly Val Glu Gin Asn Val Asp Met Val Phe Ala Ser. Phe 

485 490 495 

He Arg Lys Ala Ala Asp Val His Ala Val Arg Lys Val Leu Gly. Glu 
500 505 510 

Lys Gly. Lys His He Lys He. lie Ser Lys He Glu Asn His Glu Gly 

. 515 520 .525 



Val Arg Arg Phe Asp Glu He Met Glu Ala Ser Asp Gly He. Met Val 

530 . . 535 540 

Ala Arg Gly. Asp Leu Gly He Glu He Pro Ala Glu Lys Val Phe Leu 

545 . 550 . . 555 .560 

Ala. Gin Lys Met Met He Gly. Arg Cys Asn Arg. Ala Gly Lys Pro He 

565 570 .... 575 

He Cys Ala Thr Gin Met Leu Glu Ser. Met lie Lys Lys Pro. Arg Pro 
580 585. . 590 ... 

Thr Arg Ala Glu Gly Ser. Asp Val. Ala Asn Ala Val Leu Asp. Gly. Ala . 
... 595 600. . . ... 605 . . 

Asp Cys lie Met Leu Ser. Gly Glu Thr Ala Lys Gly Asp. Tyr Pro Leu . 
610 ... 615 ... . 620 

Glu Ala Val Arg. Met Gin His Ala He. Ala Arg Glu Ala Glu Ala Ala 

625 ... 630 635 640. . 

Met Phe His Arg. Gin Gin Phe Glu Glu lie Leu Arg. His Ser Val His 
. . . 645 650. . . 655. . . 

His Arg Glu Pro Ala Asp Ala Met Ala Ala Gly. Ala Val. Glu Ala Ser. . 

.660 665 .670 

Phe Lys Cys Leu Ala Ala Ala. Leu He. Val Met Thr Glu Ser Gly Arg 
675 ... 680. . . 685. 

Ser. Ala His Leu Val. Ser Arg. Tyr Arg. Pro. Arg. Ala. Pro. lie lie Ala 
. . . 690 695 700. .... 
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Val Thr Arg Asn Asp Gin Thr Ala Arg Gin Ala His Leu Tyr Arg Gly 

705 710 . 715 720 

Val Phe Pro Val Leu Cys Lys. Gin Pro Ala His Asp Ala Trp Ala Glu 

725 .... 730 . 735 

Asp Val. Asp Leu Arg Val Asn Leu Gly Met Asn Val Gly Lys. Ala Arg 

740 745 . 750. . 

Gly Phe Phe Lys Thr Gly Asp Leu Val lie Val. Leu Thr Gly Trp Arg. 

755 . . 760 . 765 . . 

Pro Gly. Ser Gly Tyr Thr Asn Thr. Met Arg Val. Val Pro Val Pro Ala 

770 . 775. ... 780 

Ala. Ala Leu Glu His. His His His. His. His. .... 
785 . 790 



<210> 5 . . . . 

<211> 1914 . 
<212> DNA . . 

<213> Artificial Sequence 



<220> . . . 

<223>. Description of Artificial Sequence: TAT-hACF . . . . 

<400>. 5 ... 

atggctagca tgactggtgg acagcaaatg ggtcgggatc cgggatatgg aagaaaaaaa 60 

agaagacaaa gaagaagagg ctctagatac ccctacgacg tgcccgacta cgccgatatc 120 

atggaatcaa atcacaaatc cggggatgga ttgagcggca ctcagaagga agcagccctc 180. 

cgcgcactgg tccagcgcac aggatatagc. ttggtccagg aaaatggaca aagaaaatat 240 . 

ggtggccctc cacctggttg. ggatgctgca ccccctgaaa ggggctgtga aatttttatt. 300 

ggaaaacttc cccgagacct ttttgaggat gagcttatac cattatgtga. aaaaatcggt 360 

aaaatttatg aaatgagaat gatgatggat tttaatggca acaatagagg atatgcattt. 420 

gtaacatttt caaataaagt ggaagccaag aatgcaatca agcaacttaa taattatgaa 480 

attagaaatg ggcgcctctt aggggtttgt gccagtgtgg acaactgccg attatttgtt. 540 

gggggcatcc caaaaaccaa aaagagagaa gaaatcttat cggagatgaa aaaggttact 600 

gaaggtgttg tcgatgtcat. cgtctaccca agcgctgcag ataaaaccaa aaaccgaggc 660. . 

tttgccttcg tggagtatga. gagtcatcga acagctgcca tggcgaggag gaaactgcta 720. . 

ccaggaagaa ttcagttatg. gggacatggt attgcagtag actgggcaga gccagaagta 780. . 

gaagttgatg aagatacaat gtcttcagtg aaaatcctat atgtaagaaa tcttatgctg 840. 

tctacctctg aagagatgat tgaaaaggaa ttcaacaata tcaaaccagg tgctgtggag 900. . 

agggtgaaga aaattcgaga ctatgctttt gtgcacttca gtaaccgaaa agatgcagtt 960 

gaggctatga aagctttaaa tggcaaggtg ctggatggtt cccccattga agtcacccta 1020. 

gcaaaaccag tggacaagga cagttatgtt aggtataccc gaggcacagg tggaaggggc 1080. 

accatgctgc aaggagagta tacctactct ttgggccaag tttatgatcc caccacaacc 1140 
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taccttggag ctcctgtctt ctatgccccc cagacctatg. cagcaattcc cagtcttcat 1200 

ttcccagcca ccaaaggaca tctcagcaac agagccatta. tccgagcccc ttctgttaga 1260 

ggggctgcgg gagtgagagg actgggcggc cgtggctatt. tggcatacac aggcctgggt 1320 

. cgaggatacc aggtcaaagg agacaaaaga. gaagacaaac tctatgacat tttacctggg 1380 

. atggagctca ccccaatgaa tcctgtcaca ttaaaacccc aaggaattaa actcgctccc 1440 

. cagatattag aagagatttg tcagaaaaat aactggggac agccagtgta ccagctgcac 1500 

tctgctattg gacaagacca aagacagcta ttcttgtaca aaataactat tcctgctcta 1560 

. gccagccaga atcctgcaat. ccaccctttc acacctccaa agctgagtgc ctttgtggat 1620 

. gaagcaaaga cgtatgcagc cgaatacacc ctgcagaccc tgggcatccc cactgatgga 1680 

. ggcgatggca. ccatggctac tgctgctgct gctgctactg ctttcccagg atatgctgtc 1740 

. cctaatgcaa ctgcacccgt gtctgcagcc cagctcaagc aagcggtaac ccttggacaa 1800 

gacttagcag catatacaac ctatgaggtc tacccaactt ttgcagtgac tgcccgaggg. 1860 

. gatggatatg gcaccttcgc. ggccgcactc gagcaccacc accaccacca ctga . . 1914 



. <210> 6 

. <211> 637 

. <212> PRT 

. <213>. Artificial Sequence 

. <220> . 

<223>. Description of Artificial Sequence: TAT-hACF 

<400> 6 .... ... 

Met Ala Ser Met Thr Gly Gly Gin Gin Met Gly Arg Asp. Pro. Gly Tyr 

1 5 10. . , 15. . . 



Gly Arg Lys Lys Arg Arg Gin Arg. 
20 

Asp Val Pro Asp. Tyr Ala Asp. He 
35 . . 40 



Arg. Arg Gly Ser Arg. Tyr. Pro. Tyr 
25 30 

Met Glu Ser. Asn His Lys Ser Gly 

45 



. Asp Gly Leu Ser Gly. Thr Gin Lys. Glu Ala Ala. Leu Arg Ala Leu Val. 

.50 55 60. 

Gin Arg Thr Gly Tyr Ser Leu Val Gin Glu Asn Gly. Gin Arg Lys. Tyr 

.65. 70. 75 .... 80 

Gly. Gly. Pro Pro. Pro Gly Trp. Asp Ala Ala Pro. Pro Glu Arg. Gly. Cys. 

85 ... .90 .... 95 . . 

Glu He Phe He Gly Lys Leu Pro Arg. Asp Leu Phe Glu Asp Glu Leu 

100 ... 105 . . 110 . . . 

. He Pro. Leu Cys. Glu Lys He. Gly. Lys lie Tyr. Glu Met Arg Met Met 

115. 120 . 125 . 
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Met Asp. Phe Asn Gly Asn Asn Arg Gly Tyr Ala Phe Val. Thr Phe Ser 
130 135 . 140. ... 

Asn Lys Val Glu Ala Lys Asn Ala He Lys Gin Leu Asn Asn Tyr Glu 
145. . 150 . 155 160 

He Arg Asn Gly Arg Leu Leu Gly Val Cys Ala Ser. Val Asp Asn Cys 
. . . . 165. ... 170 . 175 . . 

Arg. Leu Phe Val Gly Gly He Pro Lys Thr Lys Lys. Arg Glu Glu He 
.... .180 185 . . 190 

Leu Ser Glu Met Lys Lys Val Thr Glu Gly Val Val. Asp Val. He Val 
... 195 . . ... 200 . . 205 

Tyr. Pro Ser Ala Ala Asp Lys. Thr Lys Asn Arg. Gly. Phe Ala. Phe Val. . 
. . 210 ... .• 215 220. 

Glu Tyr. Glu Ser. His Arg Thr Ala Ala Met Ala Arg. Arg Lys. Leu Leu 
225 230 235 ... 240 

Pro Gly Arg He. Gin Leu Trp Gly His Gly. He Ala Val Asp Trp. Ala 
245 250 255. . 

Glu Pro Glu Val Glu Val. Asp Glu Asp Thr Met Ser Ser. Val Lys. He 
260 265 270. 

Leu Tyr Val. Arg. Asn Leu Met Leu Ser Thr Ser. Glu Glu Met. He Glu . 

. . . 275 280 ..... 285 

Lys Glu Phe Asn Asn He Lys Pro Gly Ala Val. Glu Arg. Val. Lys Lys 
. . . 290 . . 295 300 

lie Arg Asp. Tyr Ala Phe Val. His Phe Ser Asn Arg Lys Asp. Ala Val. 
305 ... 310 . 315 320. . 

Glu Ala Met. Lys Ala Leu Asn Gly Lys. Val Leu Asp Gly Ser. Pro He, . . . 
325 . ... .330 335 .. . 

Glu Val. Thr Leu Ala Lys. Pro Val Asp Lys Asp. Ser Tyr. Val Arg Tyr 
... 340. . . . 345 350 ... 

Thr Arg Gly. Thr Gly Gly Arg. Gly Thr. Met Leu Gin Gly. Glu Tyr. Thr 
. 355 360 . . 365. 

Tyr. Ser Leu Gly Gin Val. Tyr Asp. Pro. Thr Thr Thr. Tyr Leu Gly. Ala . . 
. ... 370 . 375 . . . . 380 ... 
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Pro Val Phe Tyr Ala Pro Gin Thr Tyr Ala Ala He Pro Ser Leu His. . 
385 . . 390 . 395 400 

Phe Pro Ala Thr. Lys. Gly His Leu Ser Asn Arg Ala He He Arg Ala 

. 405 . . 410 . . 415 . . . 

Pro Ser Val Arg. Gly Ala Ala Gly Val Arg. Gly Leu Gly Gly Arg Gly. . 

420 ... 425 . . 430. .... 

Tyr. Leu Ala Tyr Thr. Gly Leu Gly Arg Gly Tyr Gin Val Lys Gly Asp 
435 440 . . . 445. . 

Lys Arg Glu Asp. Lys. Leu Tyr. Asp He Leu Pro Gly Met Glu Leu Thr. . 

450 455. . . 460 

Pro Met Asn Pro Val Thr Leu Lys Pro. Gin Gly. He Lys Leu Ala Pro 
465. ... 470 . . 475. . 480 . 

Gin He Leu Glu Glu lie. Cys Gin Lys Asn Asn Trp. Gly Gin Pro. Val 
485 490. . . 495. 

Tyr Gin Leu His. Ser. Ala. lie Gly Gin Asp. Gin Arg. Gin Leu Phe Leu . . . 

500. . . . . 505 510 

Tyr Lys lie Thr. lie Pro. Ala Leu Ala Ser. Gin Asn Pro Ala lie His . . . 

515 .... . 520 .... 525 

Pro Phe Thr Pro Pro Lys Leu Ser Ala. Phe Val. Asp Glu Ala Lys Thr 

530 535 540 

Tyr Ala Ala Glu Tyr Thr Leu Gin Thr Leu Gly lie Pro. Thr Asp. Gly 
545 .... 550 555 560 . . 

Gly Asp Gly Thr. Met Ala Thr Ala Ala. Ala Ala. Ala Thr. Ala Phe Pro . 

565 .... .... 570 ... 575 

Gly Tyr Ala Val Pro. Asn Ala Thr Ala. Pro Val Ser Ala Ala Gin Leu 
. . 580 585. . . 590 . . . 

Lys Gin Ala Val. Thr. Leu Gly. Gin Asp Leu Ala Ala. Tyr Thr Thr. Tyr. . . 

595 .. . ... 600. 605 

Glu Val. Tyr Pro Thr. Phe. Ala Val Thr Ala. Arg. Gly. Asp Gly Tyr Gly . . . 
610 615 620 

Thr Phe Ala Ala Ala. Leu Glu His His His His. His His ... 

625. 630 . 635 
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<210> 7 
<211> 1914 

<212> DNA ... 
<213> Artificial Sequence .... ... 

<220> 

<223> Description of Artificial. Sequence: TAT-rACF . 

<400> 7 

atggctagca tgactggtgg acagcaaatg ggtcgggatc cgggatatgg aagaaaaaaa 60 

agaagacaaa gaagaagagg ctctagatac ccctacgacg tgcccgacta cgccgatatc 120 

atggaatcaa atcacaaatc cggggatgga ttgagcggca cccagaagga agcagcactc 180 

cgcgcactgg tccagcgcac aggatatagc ttggtccagg aaaatggaca. aagaaaatat 240 

ggtggtcctc caccaggctg ggatactaca cccccagaaa ggggctgcga gattttcatt 300 

gggaaacttc cccgggacct ttttgaggat gaactcatac cattgtgtga aaaaattggt 360 . . 

aaaatttatg aaatgagaat gatgatggat ttcaatggga acaacagagg ctatgcattt 420 

gtaaccttct caaataagca ggaagccaag aatgcaatca agcaacttaa taattatgaa 480 . . 

attcggaatg gccgtctcct. gggcgtctgt gccagtgtgg. acaactgccg gttgtttgtg. 540 

gggggaatcc ccaaaaccaa. aaagagagaa gaaatcttgt cagagatgaa aaaggtcact. 600 . 

gaaggagttg ttgatgtcat. tgtctaccca agcgctgccg ataaaaccaa. aaaccggggg 660. 

tttgcctttg tggaatatga gagtcaccgc gcagccgcca tggctaggcg gaggctgctg. 720. . 

ccaggaagaa ttcagttgtg. gggacatcct. atcgcagtag actgggcaga gccagaagtc 780 . 

gaagttgacg aagacacaat gtcttccgtg. aaaatcctgt. acgtaaggaa ccttatgctg 840 

tctacctcgg aagagatgat. tgagaaggaa. ttcaacagta ttaaaccagg tgctgtggaa 900 

cgggtgaaga agatccgaga ctatgctttt gtgcatttca gtaaccgaga agatgcagtt 960 

gaagccatga aggctttgaa tggcaaggtg. ctggatggtt ccccaataga agtgaccttg 1020. 

gccaagccag tggacaagga cagttacgtt aggtacaccc ggggcaccgg gggcaggaac 1080 

accatgctgc aagaatacac ctaccctctg. agccatgttt atgaccctac cacaacctac 1140. . 

cttggagctc ctgtcttcta. tactccccaa gcctacgcag ccattccaag tcttcatttc 1200. . 

ccagctacca aaggacatct. cagcaacaga gctctcatcc ggaccccttc tgtcagaggg. 1260. 

gctgcgggcg tgagaggact gggcggccgt. gggtatttgg catatacagg cctgggtcga 1320 

ggataccagg tcaaaggaga caagagacaa gacaaactct atgaccttct gcctgggatg 1380. 

gagctcaccc cgatgaatac tatctcttta aaaccacaag gagttaaact tgctcctcag 1440. . 

atattagaag aaatctgtca gaaaaataac tggggacagc cagtgtacca gctgcactct 1500. . 

gccattggac aagaccaaag acagttattc ctatacaaag taactatccc agcgctggcc 1560. . 

agccagaatc ctgcgatcca ccctttcaca cccccaaagc. taagcgccta cgtggatgaa. 1620. . 

gcaaagaggt acgccgcaga gcacacccta cagacactag gcatccccac agaaggaggg. 1680 . 

gacgctggga ctacagcacc. cactgccaca tccgccactg tgtttccagg. atacgctgtc. 1740 . 

cccagtgcca ccgctcctgt. gtctacagcc. cagctcaagc. aagcagtgac acttggacaa. 1800 

gacttagcag catatacaac. ctatgaggtc taccctactt ttgcagtgac cacccgaggt 1860. . 

gatggatatg gcaccttcgc. ggccgcactc gagcaccacc accaccacca ctga . 1914. 

<210> 8 ... 

<211> 637 ... 

<212> PRT ... 

<213> Artificial Sequence 
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<220> ... 

<223> Description of Artificial Sequence: TAT-rACF 

<400> 8 ... ... .... 

Met Ala Ser Met. Thr Gly. Gly Gin Gin Met Gly. Arg Asp Pro Gly Tyr 
.1. 5 10 15 . 

Gly Arg Lys Lys. Arg Arg. Gin Arg. Arg Arg Gly Ser Arg Tyr Pro Tyr 

.20. ... 25 ... . 30 

Asp. Val Pro Asp Tyr Ala Asp. lie Met Glu Ser Asn His Lys Ser Gly 
35 40 . 45 

Asp Gly Leu Ser. Gly Thr. Gin Lys Glu Ala Ala Leu Arg Ala Leu Val 

50 ... 55 . .... 60 

Gin Arg Thr Gly Tyr Ser. Leu Val Gin Glu Asn Gly Gin Arg. Lys Tyr 
.65. 70 75 . . . . 80 

Gly Gly Pro Pro. Pro Gly Trp Asp. Thr Thr Pro Pro Glu Arg Gly. Cys 

. . 85 90 ..... . 95 . . . 

Glu lie Phe lie. Gly. Lys Leu Pro. Arg Asp Leu Phe Glu Asp Glu Leu 

100 . . . . . 105 110 ... 

lie Pro Leu Cys Glu Lys lie Gly Lys. lie Tyr. Glu Met Arg Met Met 
.115 120 125. . . 

Met Asp. Phe Asn Gly. Asn Asn Arg. Gly. Tyr. Ala Phe Val Thr Phe Ser 
. 130. . . 135 . . . 140 

Asn Lys Gin Glu Ala Lys. Asn Ala lie. Lys. Gin Leu Asn Asn Tyr Glu 
145 150 155 ... 160 

lie Arg Asn Gly Arg. Leu Leu Gly Val. Cys. Ala Ser Val. Asp. Asn Cys 
. . 165. . 170 175 . . 

Arg Leu Phe Val. Gly Gly. lie Pro. Lys. Thr. Lys Lys. Arg Glu Glu lie. 

180. . .... 185 190 .... 



Leu Ser Glu Met Lys Lys Val Thr. Glu Gly Val Val Asp Val He Val 
195 200 205 . . 

Tyr Pro Ser Ala. Ala Asp Lys Thr Lys Asn Arg Gly. Phe Ala Phe. Val 
. . 210 . 215 220 

Glu Tyr. Glu Ser His. Arg Ala Ala. Ala Met. Ala Arg. Arg Arg. Leu Leu 
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225 . 230 235 240 

Pro. Gly Arg lie Gin Leu Trp. Gly His Pro lie Ala Val. Asp Trp Ala 
245 250 255 . . 

Glu Pro Glu Val Glu Val Asp Glu Asp. Thr Met Ser Ser Val Lys lie 
260 . 265 270. 

Leu Tyr Val Arg Asn Leu Met Leu Ser. Thr Ser Glu Glu Met lie Glu 
275 280. ... 285 

Lys Glu. Phe Asn Ser He. Lys Pro Gly Ala Val. Glu Arg Val. Lys Lys 
. . . 290. . ... 295 . . 300 

He Arg Asp Tyr Ala Phe Val His Phe Ser Asn Arg Glu Asp Ala Val 
305 .310 315 ... . . 320 

Glu Ala Met. Lys Ala Leu Asn Gly. Lys Val Leu Asp Gly Ser Pro He 

325 . 330 . . .335 

Glu Val. Thr. Leu Ala Lys. Pro Val Asp. Lys Asp Ser Tyr Val Arg Tyr 
340 .345 350 .... 

Thr Arg Gly Thr Gly Gly Arg Asn Thr Met Leu Gin Glu Tyr Thr Tyr. 

. . . 355 . . 360 . . 365 

Pro. Leu Ser. His Val Tyr Asp Pro. Thr Thr. Thr Tyr Leu Gly. Ala Pro. 
. . 370 375 .... 380 

Val Phe Tyr. Thr. Pro. Gin Ala Tyr Ala Ala. He Pro Ser. Leu His. Phe. 
385 390 395 400. 

Pro. Ala Thr. Lys Gly. His. Leu Ser Asn Arg Ala Leu He Arg Thr. Pro. 
.... 405 410. 415 

Ser. Val Arg. Gly Ala. Ala Gly Val. Arg Gly. Leu Gly Gly Arg Gly Tyr. 
420 .425 430 

Leu Ala Tyr. Thr Gly Leu Gly Arg Gly. Tyr. Gin Val. Lys Gly. Asp. Lys 
... 435 ... 440 445 

Arg. Gin Asp Lys. Leu Tyr Asp. Leu Leu Pro. Gly Met Glu Leu Thr. Pro 
450. 455 460 

Met Asn Thr. lie Ser Leu Lys Pro Gin Gly Val Lys. Leu Ala Pro. Gin 
465 . 470 475 . . 480 

He Leu Glu Glu He Cys. Gin Lys Asn Asn Trp Gly. Gin Pro. Val. Tyr 
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485 

Gin Leu His Ser Ala 
500 

Lys Val Thr He Pro 
515 

Phe Thr Pro Pro. Lys 

530 . . 

Ala Ala Glu His. Thr. 
545 

Asp Ala Gly. Thr Thr 
565. 

Gly Tyr Ala Val Pro 

. . . 580 

Lys. Gin Ala Val. Thr 
595 . . 

Glu Val Tyr Pro Thr 
.610 

Thr Phe Ala Ala Ala. 
625 



490 . 

He Gly Gin Asp Gin Arg 
505 

Ala Leu Ala Ser Gin Asn 

. . 520 

Leu Ser Ala Tyr. Val Asp 

. . . 535 .... 

Leu Gin Thr Leu Gly He 

550. . . 555 

Ala Pro Thr Ala. Thr Ser 
570 

Ser Ala Thr. Ala Pro Val 
. . 585. . . 

Leu Gly Gin Asp Leu Ala 
600 

Phe Ala Val Thr Thr. Arg 
615 . . 

Leu Glu His. His His His. 
630 . 635 



495 

Gin Leu Phe Leu Tyr 

510 

Pro Ala He His. Pro. . . 
525 . .... 

Glu Ala Lys Arg Tyr . . 

540 

Pro Thr. Glu Gly Gly. . 

560. . . 

Ala. Thr Val Phe Pro 

575 

Ser Thr Ala Gin Leu 

. 590 . ... 

Ala Tyr Thr. Thr Tyr . . 
605 . 

Gly Asp Gly Tyr Gly 
620 

His. His 



<210>. 9 

<211>. 9 

<212> PRT 

<213> Artificial. Sequence 

<220> 

<223> Description of. Artificial. Sequence: protein 
transduction domain of HIV-1 ... 

<400> 9 

Arg Lys Lys Arg. Arg Gin Arg. Arg. Arg 

1. . 5 ... 



<210> 10 

<211>. 27 . . . 

<212>. DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence: encodes, 
protein transduction domain of HIV-1 

<400> 10 ... .... 

agaaaaaaaa gaagacaaag aagaaga. ... ... 27 

<210> 11 ... 

<211> 236 

<212> PRT ... 

<213> Homo sapiens. ... 

<400> 11 . . . 

Met Thr Ser. Glu Lys Gly Pro. Ser Thr Gly Asp Pro Thr Leu Arg Arg 

1 5 ... .10 15. . . . 

Arg lie Glu Pro Trp. Glu Phe Asp. Val Phe Tyr Asp. Pro Arg Glu Leu . . . . 

. . 20 . . 25. . 30 

Arg Lys Glu Ala Cys. Leu Leu Tyr Glu lie Lys Trp Gly Met Ser Arg 

35 40. . . . .45 

Lys lie Trp Arg Ser. Ser. Gly Lys. Asn Thr Thr Asn His. Val. Glu Val. 
50 55 60 

Asn Phe lie Lys Lys Phe Thr Ser Glu Arg Asp Phe His Pro Ser lie . . . 

65 .70 ... 75. 80. 

Ser Cys Ser lie Thr. Trp. Phe Leu Ser. Trp Ser. Pro Cys Trp. Glu Cys. 

85 90 .95 

Ser Gin Ala lie Arg Glu Phe Leu Ser Arg. His. Pro Gly Val Thr. Leu 
100 105 .... 110 

Val lie Tyr Val Ala Arg Leu Phe Trp His Met Asp Gin Gin Asn. Arg 
115 120 . ... 125 ... 

Gin Gly Leu Arg. Asp. Leu Val Asn Ser. Gly Val Thr lie. Gin lie Met. 
130 ... 135 140 .... 

Arg Ala Ser Glu Tyr. Tyr His Cys. Trp. Arg Asn Phe Val. Asn Tyr Pro 
145 .150 155 160 . 

Pro Gly Asp Glu Ala His Trp Pro Gin Tyr. Pro. Pro Leu Trp Met Met. 

165 170 175 
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Leu Tyr Ala Leu Glu Leu His Cys 
180 

Leu Lys lie Ser Arg Arg Trp. Gin 
195 200 

His Leu Gin Asn Cys His Tyr Gin 
210 215. . 

Ala Thr Gly Leu He His Pro Ser 
225 230 



He. He Leu Ser Leu Pro Pro Cys 
185 190 

Asn His Leu Thr Phe Phe Arg Leu 
205 

Thr He Pro. Pro His He Leu Leu 

. 220 

Val. Ala Trp Arg 

235. . . . 



<210> 12 .... 
<211> 711 

<212> DNA . . 

<213> Homo sapiens 

<400>. 12 

atgacttctg agaaaggtcc ttcaaccggt 
tgggagtttg acgtcttcta tgaccccaga 
gaaatcaagt ggggcatgag ccggaagatc 
cacgtggaag ttaattttat aaaaaaattt. 
agctgctcca tcacctggtt cttgtcctgg. 
agagagtttc tgagtcggca ccctggtgtg. 
tggcacatgg atcaacaaaa tcggcaaggt. 
attcagatta. tgagagcatc agagtattat 
cctggggatg aagctcactg. gccacaatac 
gagctgcact gcataattct aagtcttcca 
aatcatctta catttttcag. acttcatctt 
cacatccttt tagctacagg gctgatacat 



gaccccactc tgaggagaag aatcgaaccc 60 
gaacttcgta aagaggcctg tctgctctac 120 
tggcgaagct caggcaaaaa caccaccaat 180 
acgtcagaaa gagattttca cccatccatc 240 
agtccctgct gggaatgctc ccaggctatt 300 
actctagtga tctacgtagc tcggcttttt. 360 
ctcagggacc ttgttaacag. tggagtaact 420. 
cactgctgga ggaattttgt. caactaccca 480 
ccacctctgt ggatgatgtt. gtacgcactg. 540 
ccctgtttaa agatttcaag. aagatggcaa 600. 
caaaactgcc attaccaaac gattccgcca 660. 
ccttctgtgg cttggagatg. a. 711 



<210> 13 . 

<211> 229 

<212> PRT ... 

<213> Rattus norvegicus. 

<400> 13 ... 

Met Ser Ser Glu Thr Gly Pro Val. Ala Val. Asp Pro Thr. Leu Arg Arg . 
1 5 10 .... 15 . 

Arg. He Glu Pro His Glu Phe Glu Val Phe. Phe Asp Pro Arg Glu Leu 

20. 25 . .... 30 . . 

Arg. Lys Glu Thr Cys Leu Leu Tyr. Glu He Asn Trp Gly. Gly Arg His 
... 35 40 45. 
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Ser lie Trp Arg His Thr Ser. Gin Asn Thr Asn Lys. His Val Glu Val 

50 55 60 . 

Asn Phe lie Glu Lys Phe Thr Thr Glu Arg Tyr Phe Cys. Pro. Asn Thr. 

65 70. . . 75 80 



. Arg Cys. Ser lie Thr Trp Phe Leu 

85 

. . Ser. Arg Ala lie Thr Glu Phe Leu 

100 .... 

. . Phe lie Tyr lie. Ala Arg Leu Tyr 
.115 . 120 

. . Gin Gly Leu Arg Asp Leu lie Ser. 

... 130 ... 135 . . 

Thr Glu Gin Glu Ser Gly. Tyr. Cys 
.145 150. . . . 

. . Pro Ser Asn Glu Ala His Trp Pro 
165. . 

. . Leu Tyr Val Leu Glu Leu Tyr. Cys 
180 

Leu Asn lie Leu Arg Arg Lys Gin 

. . . 195. . . 200. 

. . . Ala Leu Gin Ser Cys. His Tyr. Gin 
. 210 . . 215 

Ala. Thr. Gly. Leu Lys ... 
225 



. <210>. 14 . . 

. <211> 690 
. <212> DNA . 

. <213> Rattus norvegicus 



Ser Trp Ser. Pro. Cys Gly. Glu Cys . 

90 ... .... 95 

Ser. Arg Tyr. Pro His Val Thr Leu 
105. 110 

His. His Ala Asp Pro Arg. Asn Arg . 

.... 125 . 

Ser. Gly. Val Thr He Gin He Met . 

140 .... 

Trp. Arg Asn Phe Val Asn Tyr Ser 

. . 155 160 ... . 

Arg. Tyr. Pro His. Leu Trp Val Arg 
. 170. 175. . 

He He Leu Gly. Leu Pro Pro. Cys 
185 ... . . . 190 . ... 

Pro. Gin Leu Thr Phe. Phe Thr He. . 

205 

Arg Leu Pro Pro. His. He Leu Trp 
220 . . . .... 



<400> 14 . ... 

atgagttccg agacaggccc tgtagctgtt 

cacgagtttg aagtcttctt tgacccccgg 

gagatcaact ggggaggaag gcacagcatc 

cacgttgaag tcaatttcat agaaaaattt 

agatgctcca ttacctggtt cctgtcctgg 



gatcccactc tgaggagaag. aattgagccc 60 
gaacttcgga aagagacctg. tctgctgtat. 120 
tggcgacaca cgagccaaaa caccaacaaa 180 
actacagaaa gatacttttg. tccaaacacc 240 
agtccctgtg gggagtgctc. cagggccatt 300 
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acagaatttt tgagccgata cccccatgta actctgttta tttatatagc. acggctttat 360 
caccacgcag atcctcgaaa tcggcaagga ctcagggacc ttattagcag cggtgttact 420 
atccagatca tgacggagca agagtctggc tactgctgga ggaattttgt caactactcc 480 
ccttcgaatg aagctcattg gccaaggtac. ccccatctgt gggtgaggct. gtacgtactg 540 
gaactctact gcatcatttt aggacttcca ccctgtttaa atattttaag aagaaaacaa 600 
cctcaactca cgtttttcac gattgctctt. caaagctgcc attaccaaag gctaccaccc 660 
cacatcctgt gggccacagg gttgaaatga . 690 



<210> 15. . . 
<211> 229. 

<212> PRT 

<213> Mus musculus 



<400> 15. . . . .... 

Met Ser. Ser. Glu Thr. Gly Pro Val. Ala Val Asp. Pro Thr. Leu Arg Arg 
1 5. 10 15 

Arg lie Glu Pro His Glu Phe Glu Val Phe Phe Asp Pro Arg Glu Leu 

... . . 20 .25 30 

Arg. Lys Glu Thr Cys Leu Leu Tyr Glu lie Asn Trp Gly Gly Arg His 

35 . . . 40 . 45. . . 

Ser. Val Trp Arg. His Thr Ser Gin Asn Thr Ser Asn His. Val Glu Val. 

50 . . ... 55 60 

Asn Phe Leu Glu Lys Phe Thr Thr. Glu Arg Tyr Phe Arg. Pro. Asn Thr 

65 70 ... . .75 80 

Arg Cys. Ser lie Thr Trp Phe Leu Ser Trp Ser. Pro Cys Gly Glu Cys 

.... 85 . 90 . . . . . . 95 . . . 



Ser Arg Ala lie. Thr Glu Phe Leu Ser Arg His. Pro Tyr Val. Thr. Leu 

100. 105 110 

Phe lie. Tyr lie. Ala Arg Leu Tyr. His His Thr Asp Gin Arg. Asn Arg 

. . 115 120. . . .... 125 

Gin Gly Leu Arg Asp. Leu lie Ser Ser Gly Val Thr lie. Gin lie Met 

. . . 130 135 140 

Thr. Glu Gin Glu Tyr. Cys Tyr Cys Trp Arg Asn Phe Val. Asn Tyr Pro 

145 . . . 150. . . .155 160 

Pro. Ser Asn Glu Ala Tyr Trp Pro Arg Tyr. Pro His Leu Trp. Val Lys. 
165. 170 175 . . 
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Leu Tyr Val Leu Glu Leu Tyr Cys. 
180 

Leu Lys lie Leu Arg Arg. Lys Gin 

195 . . 200 

Thr Leu Gin Thr Cys His. Tyr Gin 

210 ... 215. 

Ala. Thr Gly Leu Lys 

225 .... 



lie lie Leu Gly Leu Pro Pro Cys. 
185 190 

Pro. Gin Leu Thr Phe Phe. Thr. lie 
205 

Arg He Pro Pro His. Leu Leu Trp . 

220 .... 



<210> 16. . . . . 

<211> 690 

<212> DNA 

<213> Mus musculus 



<400> 16 

atgagttccg agacaggccc 
cacgagtttg aagtcttctt 
gagatcaact. ggggtggaag 
cacgttgaag tcaacttctt 
agatgctcca ttacctggtt 
acagagtttc tgagccgaca 
caccacacgg atcagcgaaa 
atccagatca tgacagagca 
ccttcaaacg aagcttattg 
gagctctact. gcatcatttt. 
cctcaactca cgtttttcac. 
catctccttt gggctacagg. 



tgtagctgtt gatcccactc 
tgacccccgg. gagcttcgga 
gcacagtgtc tggcgacaca 
agaaaaattt actacagaaa 
cctgtcctgg agtccctgcg 
cccctatgta actctgttta 
ccgccaagga ctcagggacc 
agagtattgt tactgctgga 
gccaaggtac ccccatctgt 
aggacttcca. ccctgtttaa. 
aattactctt caaacctgcc 
gttgaaatga . . 



tgaggagaag. aattgagccc 60 
aagagacctg tctgctgtat 120 
cgagccaaaa caccagcaac 180 
gatactttcg. tccgaacacc 240 
gggagtgctc cagggccatt. 300 
tttacatagc acggctttat. 360 
ttattagcag cggtgtgact 420 
ggaatttcgt. caactacccc 480. 
gggtgaaact. gtatgtattg. 540. 
aaattttaag. aagaaagcaa 600. 
attaccaaag. gataccaccc 660. 

.... 690 



<210> 17 

<211> 530 

<212>. PRT 

<213> Gallus. gallus 



<400> 17 .... 

Met Ser Lys His. His Asp Ala Gly Thr Ala Phe He Gin Thr Gin Gin 

.1 . . 5 10 . . 15 . . . 

Leu His Ala Ala. Met Ala Asp. Thr Phe Leu Glu His Met Cys Arg Leu 

20 ... .25 30 

Asp He Asp Ser Glu Pro. Thr He. Ala Arg Asn Thr. Gly He. He Cys. 

35 .... .40. ... 45 
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Thr lie Gly Pro Ala Ser Arg Ser. Val. Asp Lys Leu Lys Glu Met lie 

50 55 . ' . . . .60 

Lys Ser Gly Met Asn Val. Ala Arg Leu Asn Phe. Ser His Gly, Thr His 
65 70 75 .80 

Glu Tyr. His Glu Gly Thr lie Lys Asn Val Arg Glu Ala Thr Glu Ser 

... 85 90 95. 

Phe Ala Ser Asp Pro lie Thr Tyr Arg Pro. Val. Ala lie Ala Leu Asp 

100 105 .... 110. . . 

Thr Lys Gly. Pro. Glu lie Arg Thr Gly Leu lie Lys Gly Ser Gly Thr 
115 120 125 

Ala Glu Val Glu Leu Lys. Lys Gly. Ala Ala. Leu Lys Val Thr. Leu Asp. . . 
130 . . . 135 140 

Asn Ala Phe Met. Glu Asn Cys. Asp. Glu Asn Val. Leu Trp Val Asp Tyr 

145 . 150 155 160 

Lys Asn Leu lie Lys Val lie Asp. Val. Gly Ser Lys. lie Tyr Val Asp 

165 .... 170 175 

Asp Gly Leu lie Ser Leu Leu Val. Lys. Glu Lys Gly. Lys. Asp. Phe Val 

. . 180 185 190 . . . 

Met Thr. Glu Val Glu Asn Gly Gly Met Leu Gly Ser Lys Lys. Gly Val 

... 195 200 205 

Asn Leu Pro Gly Ala. Ala Val Asp. Leu Pro. Ala Val. Ser Glu Lys Asp . . 

. 210 .... 215 220 

lie Gin Asp Leu Lys. Phe Gly Val. Glu Gin Asn Val Asp Met. Val Phe . 
225 . 230. . . 235. ... . 240. 

Ala Ser Phe lie Arg. Lys. Ala. Ala. Asp Val. His Ala Val Arg Lys. Val. . 

245 .... 250 255 

Leu Gly Glu Lys. Gly Lys His. lie. Lys lie He Ser. Lys He Glu Asn 

260 265 . . .270 

His Glu Gly Val Arg Arg Phe Asp. Glu He Met Glu Ala. Ser Asp Gly 

275 . 280 . ... 285 

He Met. Val Ala Arg Gly Asp. Leu Gly He. Glu. He. Pro. Ala. Glu Lys 

290 295 300 
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Val. Phe Leu Ala Gin Lys Met Met lie Gly Arg. Cys Asn Arg Ala Gly 
305 310. 315 ... : 320 

Lys Pro lie lie Cys Ala Thr. Gin Met Leu Glu Ser Met lie Lys Lys 

325 330 . . 335 

Pro Arg Pro Thr Arg Ala Glu Gly Ser Asp Val Ala Asn Ala Val. Leu 
340 345 ... 350 . 

Asp Gly Ala Asp Cys lie Met Leu Ser Gly Glu Thr. Ala Lys. Gly. Asp. 
355 360 365 

Tyr Pro Leu Glu Ala Val Arg Met Gin His Ala He Ala Arg. Glu Ala 
370 375 .380 

Glu Ala Ala Met Phe His Arg. Gin Gin Phe Glu Glu He Leu Arg His 
385 . . 390 395 400 

Ser Val His His Arg Glu Pro Ala Asp Ala Met Ala. Ala. Gly. Ala. Val 
405 . . .410 415. . 

Glu Ala Ser. Phe Lys Cys Leu Ala Ala Ala Leu He Val Met Thr Glu 
... . 420. . 425 430 

Ser. Gly Arg Ser Ala. His. Leu Val. Ser. Arg. Tyr. Arg. Pro. Arg Ala Pro 
435 ... 440 445 

He. He Ala Val. Thr. Arg Asn Asp. Gin Thr. Ala Arg Gin Ala His Leu 

. . . 450 455 460 . 



Tyr. Arg. Gly. Val. Phe Pro. Val Leu Cys Lys Gin Pro. Ala His Asp Ala 
465 . . 470 . 475 480 

Trp Ala Glu Asp. Val. Asp. Leu Arg. Val. Asn Leu Gly Met Asn Val. Gly . . 
.. .. 485 490 495. . . 

Lys Ala Arg Gly Phe Phe Lys Thr Gly Asp Leu Val He Val. Leu Thr 
500. . . . . 505. . 510 . 

Gly. Trp Arg. Pro Gly Ser Gly Tyr. Thr Asn Thr. Met. Arg. Val Val. Pro 
515 520 525 

Val. Pro .... 

. . . 530 



<210> 18 
<211> 1593 
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<212>. DNA 

<213> Gallus gallus. 

<400> 18 

atgtcgaagc accacgatgc agggaccgct ttcatccaga cccagcagct. gcacgctgcc 60 

atggcagaca cctttctgga gcacatgtgc cgcctggaca tcgactccga gccaaccatt 120 

gccagaaaca ccggcatcat ctgcaccatc ggcccagcct cccgctctgt. ggacaagctg 180 

aaggaaatga ttaaatctgg aatgaatgtt gcccgcctca acttctcgca cggcacccac 240. 

gagtatcatg agggcacaat taagaacgtg cgagaggcca cagagagctt. tgcctctgac 300 

ccgatcacct acagacctgt ggctattgca ctggacacca agggacctga aatccgaact 360 . 

ggactcatca agggaagtgg cacagcagag gtggagctca agaagggcgc agctctcaaa 420 

gtgacgctgg acaatgcctt. catggagaac tgcgatgaga atgtgctgtg. ggtggactac 480 

aagaacctca tcaaagttat agatgtgggc agcaaaatct atgtggatga. cggtctcatt. 540 

tccttgctgg ttaaggagaa aggcaaggac tttgtcatga ctgaggttga gaacggtggc 600 

atgcttggta gtaagaaggg agtgaacctc ccaggtgctg cggtcgacct gcctgcagtc 660 

tcagagaagg. acattcagga. cctgaaattt ggcgtggagc agaatgtgga. catggtgttc 720 

gcttccttca tccgcaaagc. tgctgatgtc catgctgtca ggaaggtgct. aggggaaaag 780 

ggaaagcaca tcaagattat. cagcaagatt gagaatcacg. agggtgtgcg. caggtttgat 840 

gagatcatgg. aggccagcga tggcattatg gtggcccgtg gtgacctggg. tattgagatc 900. 

cctgctgaaa aagtcttcct cgcacagaag atgatgattg. ggcgctgcaa cagggctggc 960 

aaacccatca tttgtgccac tcagatgttg gaaagcatga tcaagaaacc tcgcccgacc 1020 

cgcgctgagg gcagtgatgt tgccaatgca gttctggatg gagcagactg catcatgctg 1080. 

tctggggaga ccgccaaggg agactaccca ctggaggctg tgcgcatgca. gcacgctatt 1140. 

gctcgtgagg ctgaggccgc aatgttccat cgtcagcagt ttgaagaaat cttacgccac 1200 

agtgtacacc. acagggagcc tgctgatgcc atggcagcag gcgcggtgga ggcctccttt 1260 

aagtgcttag cagcagctct. gatagttatg accgagtctg gcaggtctgc. acacctggtg 1320 

tcccggtacc gcccgcgggc. tcccatcatc gccgtcaccc gcaatgacca. aacagcacgc 1380 

caggcacacc tgtaccgcgg. cgtcttcccc gtgctgtgca agcagccggc. ccacgatgcc 1440 

tgggcagagg atgtggatct ccgtgtgaac ctgggcatga atgtcggcaa. agcccgtgga 1500 

ttcttcaaga. ccggggacct ggtgatcgtg ctgacgggct ggcgccccgg ctccggctac 1560 

accaacacca. tgcgggtggt gcccgtgcca tga 1593 

<210> 19 

<211> 9 

<212> PRT 

<213> Artificial. Sequence 

<220> . . 

<223> Description of Artificial Sequence: . hemagglutinin 

. epitope tag .... .... 

<400> 19 ... 

Tyr. Pro Tyr Asp. Val Pro. Asp. Tyr. Ala 

. . I. . .5 



<210> 20 
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<211> 27 
<212> DNA 

<213> Artificial Sequence ... 

<220> .... 

<223> Description of Artificial Sequence: encodes 

hemagglutinin epitope tag ... 

<400> 20 . 

tacccctacg acgtgcccga ctacgcc .27 



<210> 21 . 

<211> 594. ... 
<212> PRT 

<213> Rattus norvegicus. 



<400> 21 ... .... 

Met Glu Ser Asn His. Lys. Ser Gly. Asp. Gly Leu Ser. Gly Thr Gin Lys 

. 1 5 10 .... . 15. . 



Glu Ala Ala Leu Arg Ala Leu Val. 

20 

Gin Glu Asn Gly Gin Arg. Lys Tyr. 

35 40 

Thr Thr Pro Pro. Glu Arg. Gly. Cys 
50 55. 

Arg. Asp Leu Phe. Glu Asp Glu Leu 

.65 .... 70 

Lys He Tyr Glu Met Arg. Met. Met 
85 



Gin Arg Thr Gly Tyr Ser. Leu Val 
25 30. . . 

Gly Gly Pro. Pro Pro Gly. Trp Asp 

. 45. . . 

Glu lie Phe He Gly. Lys Leu Pro. 

60 

He Pro Leu Cys Glu Lys He Gly 

75. 80 

Met. Asp Phe Asn Gly. Asn Asn Arg. 
90 . 95 ... . 



Gly Tyr Ala Phe. Val Thr. Phe Ser 

. . 100 ... 

He Lys Gin Leu Asn Asn Tyr Glu 

115 120 

Val. Cys Ala Ser Val. Asp Asn Cys 
130 ... 135 

Lys Thr Lys Lys Arg. Glu Glu He. 
145 . 150 . . 



Asn Lys. Gin Glu Ala. Lys Asn Ala . . 
105. ... . . 110 . . . 

He Arg. Asn Gly. Arg Leu Leu Gly . . 
. . . 125 

Arg Leu Phe Val Gly. Gly. He. Pro. 
... 140 

Leu Ser Glu Met Lys. Lys Val Thr 
155 . 160 



26 



WO 02/068676 



PCT/US02/05824 



Glu Gly Val Val Asp Val lie Val 
165 

Lys Asn Arg Gly Phe Ala Phe. Val 
180 

Ala Met Ala Arg Arg Arg Leu Leu 

195 . . 200 

His Pro lie Ala Val Asp Trp. Ala. 

210 215. . . . 



Tyr Pro Ser Ala Ala Asp Lys Thr 
170 175 

Glu Tyr Glu Ser His Arg Ala Ala 
185 ... 190. 

Pro Gly Arg lie. Gin Leu Trp Gly. 
... 205 . 

Glu Pro Glu Val Glu Val Asp Glu 
220 



Asp Thr. Met Ser Ser Val. Lys. lie Leu Tyr Val Arg Asn Leu Met Leu 
225 230 235 240. 

Ser Thr Ser Glu Glu Met lie Glu Lys Glu Phe Asn Ser lie Lys Pro 

. 245 . 250 255 

Gly. Ala Val Glu Arg Val Lys Lys lie Arg Asp Tyr. Ala Phe Val. His 
260 265 270 

Phe Ser Asn Arg Glu Asp. Ala. Val Glu Ala. Met Lys Ala Leu Asn Gly 
275 280 ... 285 

Lys Val Leu Asp Gly Ser Pro lie Glu Val. Thr Leu Ala Lys Pro Val 

290 295 300 . . 

Asp. Lys Asp. Ser Tyr Val. Arg Tyr. Thr. Arg Gly Thr. Gly Gly. Arg Asn 
305 310 . . 315. . . 320 

Thr. Met Leu Gin Glu Tyr Thr Tyr. Pro. Leu Ser. His Val. Tyr Asp Pro. 

. 325 330 .. . 335 

Thr Thr Thr Tyr Leu Gly Ala Pro Val. Phe Tyr Thr. Pro Gin Ala Tyr. 

340 . . ... 345. . . .... 350 . 

Ala Ala lie Pro Ser. Leu His. Phe Pro Ala Thr Lys. Gly His Leu Ser 

355 . . 360 . ... 365 

Asn Arg Ala Leu lie Arg. Thr. Pro Ser Val. Arg. Glu lie Tyr Met Asn 

370 375. . 380 

Val Pro Val Gly Ala Ala. Gly Val Arg Gly. Leu Gly Gly Arg Gly Tyr 
385 . . 390 . 395 400 

Leu Ala Tyr Thr Gly Leu Gly Arg Gly Tyr Gin Val Lys Gly Asp Lys 
405 . . 410. 415 
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Arg Gin Asp Lys Leu Tyr Asp Leu 
420 

Met Asn Thr lie Ser Leu Lys. Pro 
435 . . 440 

lie Leu Glu Glu lie Cys Gin Lys 

. 450 455 . . 

Gin Leu His Ser Ala lie Gly Gin 
465. . 470 



Leu Pro. Gly Met Glu Leu Thr Pro 

425 . 430 

Gin Gly. Val Lys. Leu Ala Pro Gin 
445 

Asn Asn Trp. Gly Gin Pro. Val. Tyr 
.460 .... 

Asp Gin Arg. Gin Leu Phe Leu Tyr. 

. 475 . . 480 



Lys Val Thr He Pro Ala Leu Ala Ser Gin Asn. Pro Ala He His. Pro 
485 . 490 495 . . 

Phe Thr. Pro Pro. Lys Leu Ser. Ala Tyr. Val Asp. Glu Ala Lys Arg. Tyr 
500 . . . 505 510 . 

. Ala Ala Glu His Thr. Leu Gin Thr Leu Gly He Pro Thr. Glu Gly Gly 
515 .... 520 525 

. Asp. Ala Gly Thr Thr Ala. Pro Thr Ala Thr Ser Ala Thr. Val. Phe Pro 
.530 . . 535 540 

Gly Tyr Ala Val Pro. Ser Ala Thr /Via. Pro. Val Ser Thr Ala Gin Leu . 
545 550 555 560 

Lys Gin Ala Val Thr Leu Gly Gin Asp. Leu Ala Ala Tyr Thr Thr Tyr 

. . . . 565. . . 570 575 . . 

. . . . Glu Val Tyr Pro Thr. Phe Ala Val Thr Thr. Arg Gly. Asp Gly Tyr Gly . . . 
. . 580. 585. . . .... 590 

. . Thr. Phe 



<210> 22 

<211>. 1785 .... 

. <212> DNA 

. <213> Rattus norvegicus. 

. <400> 22 

. atggaatcaa atcacaaatc. cggggatgga 
cgcgcactgg tccagcgcac. aggatatagc 
■ ggtggtcctc caccaggctg. ggatactaca. 
. gggaaacttc cccgggacct. ttttgaggat. 
. aaaatttatg aaatgagaat gatgatggat. 



ttgagcggca cccagaagga. agcagcactc 60 
ttggtccagg. aaaatggaca aagaaaatat 120. 
cccccagaaa ggggctgcga. gattttcatt 180 
gaactcatac cattgtgtga. aaaaattggt 240 
ttcaatggga acaacagagg ctatgcattt 300 
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gtaaccttct caaataagca. ggaagccaag aatgcaatca agcaacttaa taattatgaa 360. 
attcggaatg gccgtctcct gggcgtctgt gccagtgtgg acaactgccg. gttgtttgtg 420 
gggggaatcc ccaaaaccaa aaagagagaa gaaatcttgt cagagatgaa aaaggtcact 480 
gaaggagttg ttgatgtcat tgtctaccca agcgctgccg. ataaaaccaa aaaccggggg 540 
tttgcctttg tggaatatga gagtcaccgc gcagccgcca tggctaggcg gaggctgctg 600 . 
ccaggaagaa ttcagttgtg gggacatcct atcgcagtag actgggcaga gccagaagtc 660 
gaagttgacg aagacacaat gtcttccgtg aaaatcctgt acgtaaggaa ccttatgctg 720 
tctacctcgg. aagagatgat tgagaaggaa ttcaacagta ttaaaccagg tgctgtggaa 780 
cgggtgaaga. agatccgaga ctatgctttt gtgcatttca gtaaccgaga agatgcagtt 840 
gaagccatga. aggctttgaa tggcaaggtg ctggatggtt ccccaataga agtgaccttg 900 
gccaagccag tggacaagga cagttacgtt aggtacaccc ggggcaccgg gggcaggaac 960 
accatgctgc aagaatacac ctaccctctg agccatgttt atgaccctac cacaacctac 1020 
cttggagctc. ctgtcttcta tactccccaa gcctacgcag ccattccaag tcttcatttc 1080 
ccagctacca aaggacatct cagcaacaga gctctcatcc ggaccccttc tgtcagagaa 1140. 
atttacatga atgtccctgt aggggctgcg ggcgtgagag gactgggcgg ccgtgggtat. 1200 
ttggcatata caggcctggg tcgaggatac caggtcaaag. gagacaagag acaagacaaa 1260 
ctctatgacc ttctgcctgg gatggagctc accccgatga atactatctc tttaaaacca. 1320 
caaggagtta aacttgctcc tcagatatta gaagaaatct gtcagaaaaa taactgggga 1380 
cagccagtgt accagctgca ctctgccatt ggacaagacc aaagacagtt attcctatac 1440 
aaagtaacta tcccagcgct ggccagccag aatcctgcga tccacccttt cacaccccca 1500. 
aagctaagcg cctacgtgga tgaagcaaag aggtacgccg. cagagcacac cctacagaca 1560. 
ctaggcatcc ccacagaagg aggggacgct gggactacag. cacccactgc cacatccgcc 1620 
actgtgtttc. caggatacgc tgtccccagt gccaccgctc. ctgtgtctac agcccagctc 1680. 
aagcaagcag. tgacacttgg acaagactta gcagcatata caacctatga ggtctaccct 1740 
acttttgcag. tgaccacccg aggtgatgga tatggcacct tctga . ... 1785. 



<210> 23 

<211> 586 . . 

<212> PRT . 

<213>. Homo, sapiens 



<400> 23 ... 

Met Glu Ser Asn His. Lys. Ser Gly Asp Gly. Leu Ser. Gly Thr Gin Lys. . 
■ 1 ■ ■ 5 . . . 10 15 

Glu Ala Ala Leu Arg. Ala. Leu Val Gin Arg Thr Gly. Tyr Ser Leu Val 

• . 20 .25 30 

Gin Glu Asn Gly Gin Arg Lys. Tyr. Gly Gly. Pro. Pro. Pro. Gly Trp. Asp 
35. . . .40 . . 45 . . 

Ala Ala Pro Pro Glu Arg Gly. Cys Glu He Phe He Gly Lys. Leu Pro 
. . 50 55. . 60 

Arg Asp Leu Phe. Glu Asp Glu Leu He Pro. Leu Cys. Glu Lys. He Gly 
65 .... 70. . .75 ... ... 80 
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Lys lie Tyr Glu Met Arg Met Met Met Asp Phe Asn Gly Asn Asn Arg 
85. 90 95 

Gly Tyr Ala Phe. Val. Thr. Phe Ser Asn Lys Val Glu Ala Lys Asn Ala 
100 105 . 110 

He Lys Gin Leu Asn Asn Tyr Glu lie Arg Asn Gly Arg Leu Leu Gly 

115 120 125 

Val. Cys Ala Ser. Val Asp. Asn Cys Arg. Leu Phe Val Gly. Gly He Pro. 

130 135 140 .... 



Lys Thr Lys Lys Arg. Glu Glu He Leu Ser Glu Met Lys Lys. Val Thr. 
145 150 155 160. 

Glu Gly. Val Val Asp. Val. He Val Tyr Pro. Ser Ala Ala Asp. Lys Thr. 

165 170 .175 

Lys Asn Arg Gly. Phe Ala Phe Val. Glu Tyr Glu Ser. His Arg Ala Ala. 

. . 180. . ... 185. 190 



Ala Met Ala Arg. Arg Lys Leu Leu Pro Gly Arg lie. Gin Leu Trp. Gly 
195 . . 200 205 

His Gly. He. Ala Val Asp Trp. Ala Glu Pro. Glu Val Glu Val. Asp Glu 

. . 210 215 . 220 

Asp. Thr. Met Ser Ser. Val. Lys. He Leu Tyr Val. Arg Asn Leu Met Leu 

225. . . 230. . . 235. 240 

Ser Thr. Ser. Glu Glu Met He. Glu Lys Glu Phe. Asn Asn He. Lys Pro 
245 . . . 250. ... .... 255. 

Gly. Ala Val Glu Arg Val. Lys. Lys He Arg Asp. Tyr Ala. Phe Val His. 

. 260 265 . .... 270 . 

Phe Ser Asn Arg Lys. Asp. Ala Val Glu Ala. Met Lys Ala Leu Asn Gly 

. 275 280 . 285 

Lys. Val Leu Asp. Gly. Ser. Pro He. Glu Val Thr Leu Ala Lys. Pro Val. 
. 290 295 . . . 300 

Asp Lys Asp. Ser. Tyr Val Arg. Tyr Thr Arg Gly. Thr Gly. Gly. Arg. Gly. 

305 310 ... . 315 . . 320. 

Thr Met Leu Gin Gly. Glu Tyr. Thr Tyr. Ser. Leu Gly Gin Val Tyr. Asp 

325 330 . . 335. 
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Pro Thr Thr Thr Tyr. Leu Gly Ala Pro Val Phe Tyr Ala Pro Gin Thr 
340 345 350 

Tyr Ala Ala lie. Pro Ser Leu His Phe Pro Ala Thr Lys Gly His Leu 

355 .... 360 365 

Ser Asn Arg. Ala lie lie Arg. Ala Pro Ser. Val Arg Gly. Ala Ala Gly . 
.370 . 375. ... ... 380. 

Val. Arg Gly. Leu Gly Gly. Arg Gly Tyr Leu Ala Tyr Thr Gly. Leu Gly .... 
385 ... 390 395. . . . . 400 

Arg Gly Tyr Gin Val. Lys Gly. Asp Lys Arg. Glu Asp Lys Leu Tyr Asp 

. . 405 410 415 .... 

lie. Leu Pro Gly Met Glu Leu Thr. Pro. Met Asn Pro Val Thr. Leu Lys 

. . 420 . . 425 430 . .... 

Pro Gin Gly lie Lys. Leu Ala. Pro. Gin lie. Leu Glu Glu lie Cys Gin . . 
435 ... .440 445 . . . 

Lys Asn Asn Trp Gly Gin Pro Val Tyr Gin Leu His Ser. Ala lie Gly. . . 
450 ... 455 460 

Gin Asp Gin Arg Gin Leu Phe Leu Tyr. Lys. He Thr. He Pro Ala Leu 

465. . . . 470 475. . .... 480 

Ala Ser Gin Asn Pro. Ala He His. Pro Phe Thr. Pro Pro Lys Leu Ser ... 
485. ... .490 .... 495 

Ala. Phe Val Asp Glu Ala. Lys Thr Tyr. Ala Ala Glu Tyr Thr Leu Gin . . . 

500 505. ... .... 510 

Thr Leu Gly He Pro. Thr Asp Gly. Gly Asp Gly. Thr Met Ala Thr Ala . ... 

515 520 525 

Ala Ala Ala Ala Thr. Ala Phe Pro Gly Tyr Ala. Val Pro Asn Ala Thr 

530 535 ... .... 540 . ... .... 

Ala Pro Val Ser. Ala. Ala. Gin Leu Lys. Gin Ala Val. Thr. Leu Gly Gin . . . 
545 550. ... .... 555 .... 560. 

Asp Leu Ala Ala Tyr. Thr. Thr Tyr Glu Val. Tyr. Pro. Thr Phe Ala Val. . 

. 565 .570 575 

Thr Ala Arg Gly. Asp. Gly Tyr. Gly Thr Phe 

.580 585. . .... 
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<210> 24 

. <211> 1761 

<212> DNA 

<213> Homo sapiens 

<400> 24 

atggaatcaa atcacaaatc cggggatgga ttgagcggca ctcagaagga agcagccctc 60 
cgcgcactgg tccagcgcac aggatatagc ttggtccagg aaaatggaca aagaaaatat 120 

. . ggtggccctc cacctggttg ggatgctgca ccccctgaaa ggggctgtga aatttttatt 180. 
ggaaaacttc cccgagacct ttttgaggat gagcttatac cattatgtga. aaaaatcggt 240 
aaaatttatg aaatgagaat gatgatggat tttaatggca acaatagagg atatgcattt 300 
gtaacatttt caaataaagt ggaagccaag aatgcaatca agcaacttaa taattatgaa 360 
attagaaatg ggcgcctctt aggggtttgt gccagtgtgg acaactgccg. attatttgtt 420 
. gggggcatcc caaaaaccaa aaagagagaa gaaatcttat cggagatgaa. aaaggttact 480 
gaaggtgttg tcgatgtcat cgtctaccca agcgctgcag. ataaaaccaa. aaaccgaggc. 540 
tttgccttcg tggagtatga gagtcatcga gcagctgcca. tggcgaggag. gaaactgcta 600 
ccaggaagaa ttcagttatg gggacatggt attgcagtag. actgggcaga. gccagaagta 660 
gaagttgatg. aagatacaat gtcttcagtg aaaatcctat atgtaagaaa tcttatgctg 720. 
tctacctctg aagagatgat tgaaaaggaa ttcaacaata tcaaaccagg tgctgtggag 780 
agggtgaaga. aaattcgaga. ctatgctttt gtgcacttca gtaaccgaaa agatgcagtt 840 
. gaggctatga aagctttaaa. tggcaaggtg ctggatggtt cccccattga agtcacccta 900 
gcaaaaccag tggacaagga cagttatgtt aggtataccc gaggcacagg tggaaggggc 960 
accatgctgc aaggagagta. tacctactct. ttgggccaag. tttatgatcc caccacaacc 1020 
taccttggag ctcctgtctt. ctatgccccc. cagacctatg. cagcaattcc cagtcttcat 1080. . 

. . ttcccagcca ccaaaggaca tctcagcaac agagccatta. tccgagcccc ttctgttaga 1140. . 

. . ggggctgcgg gagtgagagg. actgggcggc cgtggctatt tggcatacac aggcctgggt 1200. 

. . cgaggatacc aggtcaaagg agacaaaaga gaagacaaac tctatgacat. tttacctggg 1260. . 

. . atggagctca ccccaatgaa tcctgtcaca ttaaaacccc aaggaattaa actcgctccc 1320. . 

. . cagatattag aagagatttg tcagaaaaat. aactggggac agccagtgta ccagctgcac. 1380. . 
tctgctattg gacaagacca aagacagcta. ttcttgtaca. aaataactat tcctgctcta 1440. 
gccagccaga atcctgcaat ccaccctttc. acacctccaa agctgagtgc ctttgtggat 1500 

. . gaagcaaaga cgtatgcagc cgaatacacc ctgcagaccc tgggcatccc. cactgatgga 1560 . 
ggcgatggca ccatggctac tgctgctgct gctgctactg. ctttcccagg. atatgctgtc 1620. 

. . cctaatgcaa ctgcacccgt gtctgcagcc cagctcaagc. aagcggtaac ccttggacaa 1680 

. . gacttagcag catatacaac ctatgaggtc tacccaactt. ttgcagtgac tgcccgaggg 1740. 

. . gatggatatg gcaccttctg a 1761. . 

. <210> 25 

<211> 45 

<212> DNA 

<213> Artificial. Sequence 

<220> 

<223> Description of Artificial. Sequence:, oligomer 

encoding. TAT protein transduction domain 
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<400>. 25 

catatgggaa gaaaaaaaag aagacaaaga agaagaggcc tcgag '45 

<210> 26 

<211> 2274 . . . 

<212> DNA 

<213> Artificial Sequence. 

. <220> 

<223> Description of Artificial Sequence: 

HA-rAPOBEC-CMPK construct 

<400> 26 

atgggctcta gataccccta cgacgtgccc gactacgccg. atatcagttc. cgagacaggc 60 
cctgtagctg ttgatcccac tctgaggaga agaattgagc cccacgagtt. tgaagtcttc. 120 
tttgaccccc gggaacttcg gaaagagacc tgtctgctgt atgagatcaa. ctggggagga 180 
aggcacagca tctggcgaca cacgagccaa aacaccaaca aacacgttga agtcaatttc. 240 . 
atagaaaaat ttactacaga. aagatacttt tgtccaaaca ccagatgctc cattacctgg 300 
ttcctgtcct ggagtccctg. tggggagtgc. tccagggcca. ttacagaatt tttgagccga 360 
tacccccatg taactctgtt. tatttatata gcacggcttt. atcaccacgc agatcctcga 420. 
aatcggcaag gactcaggga ccttattagc agcggtgtta ctatccagat catgacggag 480 . 
caagagtctg gctactgctg gaggaatttt gtcaactact ccccttcgaa tgaagctcat 540 
tggccaaggt acccccatct gtgggtgagg ctgtacgtac tggaactcta ctgcatcatt 600. 
ttaggacttc caccctgttt aaatatttta agaagaaaac aacctcaact cacgtttttc 660. 
acgattgctc ttcaaagctg ccattaccaa aggctaccac cccacatcct. gtgggccaca 720 
gggttgaaag aattccacgc tgccatggca gacacctttc tggagcacat. gtgccgcctg. 780 
gacatcgact. ccgagccaac cattgccaga aacaccggca. tcatctgcac catcggccca 840 
gcctcccgct ctgtggacaa. gctgaaggaa atgattaaat ctggaatgaa. tgttgcccgc. 900 
ctcaacttct cgcacggcac ccacgagtat catgagggca caattaagaa. cgtgcgagag 960 
gccacagaga. gctttgcctc tgacccgatc acctacagac ctgtggctat tgcactggac 1020. 
accaagggac ctgaaatccg aactggactc* atcaagggaa gtggcacagc agaggtggag 1080 
ctcaagaagg. gcgcagctct caaagtgacg ctggacaatg. ccttcatgga gaactgcgat 1140 
gagaatgtgc tgtgggtgga ctacaagaac ctcatcaaag. ttatagatgt. gggcagcaaa. 1200. 
atctatgtgg. atgacggtct catttccttg ctggttaagg. agaaaggcaa. ggactttgtc 1260 
atgactgagg ttgagaacgg tggcatgctt ggtagtaaga. agggagtgaa. cctcccaggt 1320. 
gctgcggtcg acctgcctgc agtctcagag aaggacattc aggacctgaa. atttggcgtg 1380 
gagcagaatg tggacatggt gttcgcttcc ttcatccgca. aagctgctga. tgtccatgct 1440 
gtcaggaagg tgctagggga. aaagggaaag cacatcaaga ttatcagcaa gattgagaat 1500 
cacgagggtg tgcgcaggtt. tgatgagatc atggaggcca. gcgatggcat tatggtggcc 1560 
cgtggtgacc tgggtattga. gatccctgct gaaaaagtct tcctcgcaca. gaagatgatg 1620 
attgggcgct gcaacagggc tggcaaaccc atcatttgtg ccactcagat gttggaaagc. 1680 
atgatcaaga aacctcgccc. gacccgcgct gagggcagtg atgttgccaa tgcagttctg 1740. 
gatggagcag actgcatcat gctgtctggg gagaccgcca agggagacta cccactggag 1800 
gctgtgcgca tgcagcacgc tattgctcgt gaggctgagg ccgcaatgtt ccatcgtcag 1860 
cagtttgaag aaatcttacg ccacagtgta caccacaggg. agcctgctga tgccatggca 1920. 
gcaggcgcgg tggaggcctc. ctttaagtgc ttagcagcag. ctctgatagt tatgaccgag. 1980 
tctggcaggt ctgcacacct. ggtgtcccgg taccgcccgc. gggctcccat catcgccgtc 2040 
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acccgcaatg accaaacagc acgccaggca cacctgtacc gcggcgtctt. ccccgtgctg 2100 
tgcaagcagc cggcccacga tgcctgggca gaggatgtgg atctccgtgt gaacctgggc 2160 
atgaatgtcg. gcaaagcccg tggattcttc aagaccgggg acctggtgat cgtgctgacg 2220 
ggctggcgcc ccggctccgg ctacaccaac accatgcggg tggtgcccgt gcca 2274 

<210> 27 .... 

<211> 1590 

<212> DNA 

<213> Artificial. Sequence 

<220> 

<223> Description of. Artificial Sequence: HA-CMPK 
construct 

<400>. 27 

ctcgagatgt. acccctacga cgtgcccgac tacgccgata tccacgctgc catggcagac 60. . 
acctttctgg. agcacatgtg ccgcctggac atcgactccg agccaaccat tgccagaaac 120. 
accggcatca tctgcaccat. cggcccagcc tcccgctctg tggacaagct gaaggaaatg. 180 
attaaatctg gaatgaatgt. tgcccgcctc. aacttctcgc. acggcaccca cgagtatcat 240 
gagggcacaa ttaagaacgt. gcgagaggcc acagagagct ttgcctctga cccgatcacc 300 
tacagacctg tggctattgc actggacacc aagggacctg aaatccgaac. tggactcatc 360. 
aagggaagtg gcacagcaga ggtggagctc aagaagggcg cagctctcaa agtgacgctg. 420 
gacaatgcct tcatggagaa ctgcgatgag aatgtgctgt gggtggacta caagaacctc 480. 
atcaaagtta tagatgtggg. cagcaaaatc tatgtggatg acggtctcat. ttccttgctg. 540. 
gttaaggaga aaggcaagga ctttgtcatg actgaggttg agaacggtgg catgcttggt 600. . 
agtaagaagg gagtgaacct cccaggtgct gcggtcgacc tgcctgcagt ctcagagaag. 660. . 
gacattcagg. acctgaaatt tggcgtggag cagaatgtgg. acatggtgtt cgcttccttc. 720 . 
atccgcaaag ctgctgatgt. ccatgctgtc aggaaggtgc taggggaaaa gggaaagcac 780 . 
atcaagatta tcagcaagat tgagaatcac gagggtgtgc gcaggtttga tgagatcatg 840 . 
gaggccagcg atggcattat ggtggcccgt ggtgacctgg gtattgagat. ccctgctgaa 900. . 
aaagtcttcc tcgcacagaa gatgatgatt gggcgctgca. acagggctgg caaacccatc 960. . 
atttgtgcca ctcagatgtt ggaaagcatg atcaagaaac ctcgcccgac ccgcgctgag. 1020. 
ggcagtgatg ttgccaatgc agttctggat ggagcagact gcatcatgct gtctggggag. 1080 
accgccaagg gagactaccc actggaggct gtgcgcatgc agcacgctat tgctcgtgag 1140. 
gctgaggccg caatgttcca tcgtcagcag tttgaagaaa tcttacgcca cagtgtacac 1200. 
cacagggagc ctgctgatgc catggcagca ggcgcggtgg aggcctcctt taagtgctta. 1260. 
gcagcagctc tgatagttat gaccgagtct ggcaggtctg cacacctggt gtcccggtac 1320. 
cgcccgcggg ctcccatcat cgccgtcacc cgcaatgacc aaacagcacg. ccaggcacac 1380 
ctgtaccgcg gcgtcttccc cgtgctgtgc aagcagccgg. cccacgatgc ctgggcagag 1440 
gatgtggatc tccgtgtgaa cctgggcatg aatgtcggca. aagcccgtgg. attcttcaag 1500 
accggggacc tggtgatcgt. gctgacgggc tggcgccccg gctccggcta caccaacacc 1560 
atgcgggtgg tgcccgtgcc atgactcgag 1590 

<210> 28 

<211> 1629 ... 

<212> DNA 
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<213> Artificial Sequence ... 
<220> 

<223> Description of Artificial Sequence: TAT-HA-CMPK .. . 

construct . . . . 

<400> 28 ... ... 

catatgggaa gaaaaaaaag aagacaaaga agaagaggcc tcgagatgta cccctacgac 60 . 

gtgcccgact acgccgatat ccacgctgcc atggcagaca cctttctgga gcacatgtgc. 120. . 

cgcctggaca tcgactccga gccaaccatt gccagaaaca ccggcatcat ctgcaccatc. 180. . 

ggcccagcct cccgctctgt ggacaagctg aaggaaatga ttaaatctgg aatgaatgtt 240 . 

gcccgcctca acttctcgca cggcacccac gagtatcatg agggcacaat taagaacgtg. 300 

cgagaggcca cagagagctt tgcctctgac ccgatcacct acagacctgt ggctattgca. 360 . 

ctggacacca agggacctga aatccgaact ggactcatca agggaagtgg cacagcagag. 420 . 

gtggagctca agaagggcgc agctctcaaa gtgacgctgg acaatgcctt catggagaac. 480. . 

tgcgatgaga atgtgctgtg ggtggactac aagaacctca tcaaagttat agatgtgggc 540 . 

agcaaaatct atgtggatga cggtctcatt tccttgctgg ttaaggagaa aggcaaggac 600 

tttgtcatga. ctgaggttga gaacggtggc atgcttggta. gtaagaaggg agtgaacctc 660. . 

ccaggtgctg cggtcgacct gcctgcagtc tcagagaagg acattcagga cctgaaattt 720 

ggcgtggagc agaatgtgga catggtgttc gcttccttca tccgcaaagc tgctgatgtc 780 

catgctgtca ggaaggtgct aggggaaaag ggaaagcaca tcaagattat cagcaagatt. 840 

gagaatcacg agggtgtgcg caggtttgat gagatcatgg aggccagcga. tggcattatg. 900. 

gtggcccgtg gtgacctggg. tattgagatc cctgctgaaa aagtcttcct cgcacagaag. 960 

atgatgattg ggcgctgcaa cagggctggc aaacccatca tttgtgccac tcagatgttg 1020. 

gaaagcatga tcaagaaacc tcgcccgacc cgcgctgagg gcagtgatgt tgccaatgca 1080 

gttctggatg gagcagactg. catcatgctg tctggggaga ccgccaaggg. agactaccca 1140 

ctggaggctg tgcgcatgca gcacgctatt gctcgtgagg ctgaggccgc aatgttccat 1200 

cgtcagcagt ttgaagaaat cttacgccac agtgtacacc acagggagcc tgctgatgcc 1260 

atggcagcag gcgcggtgga. ggcctccttt aagtgcttag cagcagctct. gatagttatg 1320 

accgagtctg gcaggtctgc acacctggtg. tcccggtacc gcccgcgggc. tcccatcatc. 1380 

gccgtcaccc gcaatgacca. aacagcacgc caggcacacc tgtaccgcgg. cgtcttcccc 1440 

gtgctgtgca agcagccggc ccacgatgcc tgggcagagg atgtggatct ccgtgtgaac 1500 

ctgggcatga. atgtcggcaa agcccgtgga ttcttcaaga ccggggacct ggtgatcgtg 1560 

ctgacgggct. ggcgccccgg ctccggctac accaacacca. tgcgggtggt. gcccgtgcca 1620 

tgactcgag 1629 

<210> 29 

<211> 23 . 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial. Sequence:. . primer ND1 

<400> 29 

atctgactgg gagagacaag tag 23 
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<210> 30 ... 

<211> 23 

<212> DNA ... 
<213> Artificial Sequence ... 

<220> 

<223> Description of Artificial. Sequence:. . primer. ND2 

<400> 30 . ... 

gttcttttta agtcctgtgc ate .... 23 



<210> 31 . 

<211> 35 ... . .... ... 

<212>. DNA . 

<213> Artificial Sequence 

<220> 

<223> Description of. Artificial Sequence:. . primer. DD3 

<400> 31 

aatcatgtaa atcataacta tctttaatat actga . . 35 
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